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Novel Compounds 

The present invention relates to recombinant heterochimeric paramyxoviridae 
glycoproteins and their expression in eukaiyotic cells, particularly in Chinese 
5 Hamster Ovary (CHO) cells. The invention further relates to methods for 
constructing and expressing such heterochimeric proteins, intermediates for use 
therein, methods to optimize the codon usage of the nucleic acid sequences which 
encode such heterochimeric proteins and the use of the recombinant proteins as 
vaccines for the prevention of diseases caused by paramyxoviridae pathogens. 

10 

The mumps (MuV), Measles (MV), the parainfluenza typeT (PIV1), type II (PIV2) 
and type III (PIV3) and the respiratory syncytial (RSV) virus belong to the 
paramyxoviridae family. The MuV is classified in the rubulavirus subclass, the MV 
is classified in the Morbillivirus subclass, the parainfluenza viruses (PIV1, PIV2 
15 and PIV3) are classified in the paramyxovirus subclass while the RSV is attached to 
the pneumovirus subclass. 

RSV is the most important cause of viral lower respiratory tract disease in infants 
and children. The fusion (F) and the attachment (G) protein which are both viral 
20 surface glycoproteins appear to be of potential value for the development of a 
vaccine against RSV. 

The fusion protein F of RSV contains 574 amino acid residues; amino acids 1 to 21 
correspond to the signal peptide and residues 525 to 549 to the membrane anchor 
25 domain. The molecule presents five potential sites for glycosylation. The F protein 
is synthesized as a 70 kDa precursor (F 0 ) which undergoes proteolytic maturation to 
yield the F, subunit (48 kDa) and F 2 (23 kDa) linked via disulfide bridges. The 
protein F, when injected into animals, leads to the production of neutralizing 
antibodies and may induce cytotoxic lymphocytes (CTLs). 

30 

The attachment or G protein of RSV contains 298 amino acid residues and is 
heavily glycosylated since half of its molecular mass (90 kDa) is contributed by 
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oligosaccharide side chains, chiefly in the form of O-linked sugars. It has been 
shown that the G protein, when injected into animals, provides protection against 
homologous but not heterologous subgroup virus challenge. This protein is 
extremely variable and there is only a stretch of 13 amino acid residues which is 
5 conserved in all RSV. 

The PIV3 is second to RSV as a major agent of severe viral respiratory tract 
infections in infants. The fusion protein F of PIV3 contains 539 amino acid 
residues; amino acids 1 to 18 correspond to the signal peptide and residues 494 to 

10 516 to the membrane anchor domain. The molecule presents 4 potential sites for 
glycosylation. The F protein is synthesized as a 70 kDa precursor (F 0 ) which 
undergoes proteolytic maturation to yield the F, (56 kDa) and F 2 (14 kDa) subunits 
linked via disulfide bridges. The protein F, when injected into animals, leads to the 
production of neutralizing antibodies. The F protein is involved in cell fusion during 

15 viral infection and carries an hemolysin activity. Used alone for immunization, the 
F protein generates an immune response which is insufficient to confer protection 
against a challenge with the vims. Complete protection is only acquired by 
concomitant immunization with the attachment protein HN, another glycoprotein of 
PIV3. 

20 

The protein HN carries hemagglutinin and neuraminidase activities. It is composed 
of 572 amino acids; its membrane anchor domain occurs in the N-tenninal end of 
the molecule, between amino acid residues 32 and 53. Four potential sites for 
glycosylation have been identified. Injection of protein HN into animals generates 
25 an immune response and neutralizing antibodies. These antibodies however do not 
protect completely against a challenge with the virus. Full protection is obtained 
only by concomitant immunization with the F protein of PIV3. 

The PIV1 virus was initially isolated from young children suffering from disorders 
30 of the lower respiratory tract. Infection with PIV1 causes the majority of cases of 
croup found for all infections caused by paramyxoviruses. Viral transmission of 
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PIV1 is by person to person contact or by aerosol, although the virus does not 
persist in the environment for long. 

Like PIV2 and PIV3, the PIV1 virus has two surface glycoproteins, the fusion 
5 protein (F) and the attachment protein (HN). These two proteins are the priority 
targets for the development of a subunit vaccine, the properties of which would be 
to ensure protection of children from the very first months of life and to prevent 
reinfection, or at least to prevent the serious complications by restricting viral 
development to the upper respiratory tract where the consequences would be benign 
10 (common cold). 

PI V2 also affects very young children and causes the same type of respiratory 
discorders, essentially croup, but of less severity. The PIV2 virus has two surface 
glycoproteins (F and HN), which are potential targets for the development of a 
15 subunit vaccine. 

The measles virus is an extremely contagious agent which establishes itself in the 
epithelial cells of the respiratory tract, the oropharynx or the conjunctiva. The 
infection causes fever, cough, head-cold, conjunctivitis and a characteristic 
20 generalised rash. 



There is no appropriate inactivated vaccine against measles but an effective 
attenuated live vaccine is available and is generally used in combination with the 
attenuated live vaccines against rubella and mumps. This live vaccine protects 

25 against the disease for at least 20 years. The measles virus has two surface 

glycoproteins, which are potential targets for the development of a subunit vaccine. 
The fusion protein (F) is a 550 amino acid long glycosylated molecule and, as for 
the other paramyxovirus, has to undergo proteolitic cleavage to yield F, and F 2 
subunits that are linked via disulfide bridges. This molecule, which carries a 

30 haemolysin activity, generates an immune protective response when injected into 
animals. The attachment protein (H), is a 617 amino acid long glycosylated protein, 
which carries a hemagglutinin activity. This protein leads, when injected into 
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animals, to the production of neutralizing antibodies that are able to inhibit 
hemagglutination. This immune response protects the animal against a viral 
challenge. 



5 The mumps virus is a pathogen causing the contagious infantile illness which 
consists of the inflammation of parotid glands. During the incubation period 
following infection, the virus replicates in the respiratory epithelium then 
disseminates into secretary ducts of the parotid glands. Other glands may become 
infected thereafter and numerous cases of meningitis have been reported. Among 
10 complications related to the infection, encephalitis is a serious one, with a mortality 
rate of about 1%; deafness cases have also been reported. 

A vaccine against mumps is available: it is made of an attenuated live virus, 
produced by culturing infected embryonic chicken cells. The vaccine leads to the 
15 seroconversion in vaccinated individuals and protects against infection in more than 
95 % of seronegative persons. The vaccine thus reduces significantly the frequencies 
of complications. 

In a number of cases, however, viral infection is not detected because the effects 
20 remain subclinical. Young children and aged people are most likely to develop 

complications from mumps infection. In view of the inherent risks related to the use 
of attenuated live vaccines, such as the potentiation of the illness upon natural 
surinfection in vaccinated individuals, it is desirable to improve the safety of the 
vaccine, particularly for the groups at-risk. 

25 

The fusion protein F of mumps virus contains 538 amino acid residues; amino acids 
1 to 26 correspond to the signal peptide and residues 483 to 512 to the membrane 
anchor domain. The molecule presents 7 potential sites for glycosylation. The F 
protein is synthesized as a 65-74 kDa precursor (F 0 ) which undergoes proteolytic 
30 maturation to yield the F, (58-61 kDa) and F 2 (10-16 kDa) subunits linked via 
disulfide bridges. The protein F is involved in cell fusion during viral infection, 
carries an haemolysin activity and plays a role for viral penetration into cells. It 
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does not however cany the antibody dependent cellular cytotoxicity (ADCO as 
observed for another mumps virus glycoprotein, HN. 

The protein HN (molecular weight 74-80 kDa) carries hemagglutinin and 
5 neuraminidase activities which are involved in virus attachment to cells and in the 
disruption of the host cell membranes. Protein HN (attachment protein or 
hemagglutinin-neuraminidase) generates neutralizing antibodies and appears 
important for the development of ADCC. Protein HN is composed of 582 amino 
acids; it carries a N-terminal anchor domain (residues 33 to 52) and 9 potential sites 
10 for glycosylation. 

For the viruses considered above, it appears that concomitant immunization with 
both membrane glycoproteins F and HN, or G in the case of RSV, are required to 
achieve full protection in the animal model. Chimeric proteins containing both the F 
15 and G proteins of RSV, or the F and HN proteins of PIV3 have shown complete 
protection against RSV or PIV3 challenge in cotton rats (Brideau et al, J Gen Virol, 
1989, 70 2637-2644 and Brideau et al, J Gen Virol, 1993, 74, 471-477). 

W093 14207 (Connaught) describes heterochimeric proteins comprising RSV and 
20 PIV3 proteins including F(RSV)xHN(PIV3) and F(PIV3)xG(RSV) hybrids, and 
suggests that such proteins can be expressed from a variety of host cells including 
bacterial, mammalian, insect, yeast and fungal cells. The specific examples 
describe expression in insect Sf9 and High 5 cells and mammalian Vera cells. 
There is no specific disclosure of the use of CHO cells. The use of Sf9 and High 5 
25 cells is also described by Du et al, BIO/TECHNOLOGY 12,1994, 813-818. 

Homa et al (Upjohn), J Gen Virol, 1993, 74, 1995-1999 describes another 
heterochimeric protein, F(RSV)xHN(PIV3) expressed in insect cells using a 
recombinant baculovims. 



30 



Homochimeric paramyxoviridae glycoproteins have also been described by several 
workers:- 
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WO8905823 (Upjohn) describes RSV FxG and GxF hybrids which can be expressed 
from bacterial, yeast, mammalian and insect cells. Example 7 describes the 
expression of an RSV FxG protein from CHO cells although there are no details of 
5 how successful such expression is. 

WO8910405 (Upjohn) describes PIV3 FxHN and HNxF hybrids which can be 
expressed from bacterial, yeast, mammalian and insect cells. Example 6 describes 
the expression of a PIV3 FxHN protein from CHO cells, however no details are 
10 given quantifying the extent of expression and secretion. 

Lehman et al (Upjohn), J Gen Virol, 1993, 74, 459-469 describes the expression of 
PIV3 FxHN in insect cells using recombinant baculovirus vectors as well as in CHO 



15 



cells. 



WO9306218 (SmithKline Beecham Biologicals) describes PIV3 FxHN hybrids 
which can be expressed in eukaryotic cells including vaccinia, CHO or Vero cells. 
Example B)2 describes the expression of a Fs + a xHNa hybrid in CHO cells and 
indicates that the product was almost evenly distributed between cells and medium. 
20 No details are however given quantifying the extent of expression and secretion. 

WO9425600 (SmithKline Beecham Biologicals) describes MuV FxHN and HNxF 
hybrids which can be expressed in vaccinia, a mammalian cell (such as CHO) or a 
bacterial cell. Examples B) 3 and 4 describe the expression of s + FHNa"xFa' and 
25 Fs + a xHNa - in CHO cells however no details are given describing the extent of 
expression and secretion. 

Although this cited art may suggest that homochimeric paramyxoviridae 
glycoproteins can be expressed in a variety of cell lines including CHO cells it has 
30 now been discovered that in fact expression and secretion from CHO cells is not 
always successful and success cannot be predicted. Thus it has now been 
demonstrated that although a RSV FxG hybrid could be successfully expressed and 
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secreted in CHO cells, analogous homochimeric hybrids from PIV3 and MuV could 
not in fact be expressed in CHO cells in such manner that they could be purified 
from the supernatant in significant quantities. 

5 Surprisingly , it has now been discovered that heterochimeric hybrids can be 
successfully expressed and secreted in both CHO and insect cells. 

Accordingly in a first aspect the present invention provides a process for preparing a 
heterochimeric protein or an immunogenic derivative thereof comprising an 
10 immunogenic fragment of the fusion (F) protein of RSV, HV1, PIV2. PIV3, MV 
or MuV and an immunogenic fragment of the attachment (G, HN or H) protein of 
RSV, PIV1, PIV2, PIV3, MuV or MV which process comprises expressing 
recombinant DNA encoding the heterochimeric protein or immunogenic derivative 
thereof in CHO cells and recovering the protein. 

15 

By heterochimeric protein is meant one that does not contain a fusion or attachment 
protein from the same pathogen. 

This invention also provides novel heterochimeric proteins not previously described 
20 in WO 93 14207 which can be prepared using the process of the present invention. 

Thus, in a second aspect the present invention provides a heterochimeric protein or 
an immunogenic derivative thereof comprising an immunogenic fragment of the 
fusion (F) protein of RSV, PIV1, PIV2, PIV3, MV or MuV and an immunogenic 
25 fragment of the attachment (G, HN or H) protein of RSV, PIV1 , PIV2, PIV3, MuV 
or MV, with the proviso that where one of the immunogenic fragments is derived 
from RSV F, RSV HN or PIV3 F, PIV3 HN, the other of the immunogenic 
fragments is derived from MuV F, MuV HN, MV F, MV H, PIV1 F.PIVl HN, 
PIV2ForPIV2HN. 



30 



By an immunogenic fragment of the fusion (F) protein of RSV, PIV1, PIV2, PIV3, 
MV or MuV is meant a part of the protein which contains at least one antigenic 
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determinant capable of raising an immune response specific to the F protein of 
RSV, PIV1. PIV2, PIV3, MV or MuV respectively. Included within this definiuon 
is the full length F protein, preferably however the immunogenic fragment is 
lacking the membrane anchor domain at its C-terminal end. 

By an immunogenic fragment of the attachment protein (G, HN or H) of RSV, 
PIV1 . PIV2, PIV3, MuV or MV is meant a part of the protein which contains at 
least one antigenic determinant capable of raising an immune response specific to 
the G protein of RSV, to the HN protein of PIV1 , PIV2, PIV3. MuV or the H 
10 protein of MV respectively. Included within this definition is the full length G or 
HN protein, preferably however the immunogenic fragment is lacking the 
signal/anchor domain at its N-terminal end. 

Preferably the heterochimeric protein is linked via an amino acid in the C-terminal 
15 part of the immunogenic fragment of the F protein of RSV, PIV1, PIV2, PIV3, 
MV or MuV to an amino acid in the N-terminal part of the immunogenic fragment 
of the G protein of RSV, the HN protein of PIV1 , PIV2, PIV3, MuV or the H 

protein of MV. 

20 Suitably the heterochimeric protein commences at its N-terrninal end with a signal 
sequence from the F protein of RSV, PIV1, PIV2, PIV3, MV or MuV. 
Conveniently this will be pan of the corresponding immunogenic fragment of the F 
protein of RSV, PIV1, PIV2, PIV3, MV or MuV when this fragment is linked via 
its C-terminal end to the N-terminal end of the immunogenic fragment of the G 

25 protein of RSV, the HN protein of PIV1 . PIV2, PIV3, MuV or the H protein of 
MV. 

Alternative signal sequences may also be employed. For example, the 
heterochimeric protein suitably commences at its N-terminal end with a signal 
30 sequence of tissue plasminogen activator (TP A). 
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In order to enhance the level of expression the heterochimeric protein may further 
comprise a ubiquitin leader sequence which is suitably positioned after any signal 
sequence as hereinbefore described. Preferably the ubiquitin leader sequence is 
linked to the C-tenninal end of the signal sequence of TPA. 

5 

Preferably the ubiquitin leader sequence is derived from yeast, for example as 
described in Ecker et al, J.Biological Chemistry, 1988, 264(13), 7715-7719. 

Suitably a cleavage site is positioned between the C-tenninal end of the ubiquitin 
10 sequence and the N-terminal end of the immunogenic fragment of the F protein of 
RSV, PIV1, PIV2, PIV3, MV or MuV. 

In order to facilitate chromatographic purification the heterochimeric protein 
suitably comprises a polyhistidine tail, for example as described in Hochuli et al, 

15 BIO/TECHNOLOGY, 1988, 1321-1325. The polyhistidine tail preferably 

comprises from 2 to 6 adjacent histidine residues which is suitably attached at the C- 
terminal end of the heterochimeric protein. Preferably a cleavage site is positioned 
between the polyhistidine tail and the C-tenninal end of the immunogenic fragment 
of the G protein of RSV, the HN protein of PIV1, PIV2, PIV3, MuV or the H 

20 protein of MV. 

The cleavage site for the ubiquitin sequence and/or the polyhistidine tail may be 
chemical or enzymatic and preferably is an enterokinase cleavage site, for example 
as described in LaVallie et al, BIO-TECHNOLOGY, 1993, 187-193. 

25 

Following expression and purification, treatment with an enterokinase will cleave 
off any ubiquitin and/or polyhistidine sequence releasing the desired heterochimeric 
protein. 

30 Particular heterochimeric proteins of this invention include: 

the F protein of RSV lacking its membrane domain linked at its C-tenninal end to 
the HN protein of MuV lacking its signal/anchor domain herein referred to as: 
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Fs + a'RSVxHNs'a*MuV, as well as 



Fs + a-PIV3 xHNsa'MuV; 



Fs + a"MuV x Gs a RSV; and 



5 



FsV MuV x HNsaPIV3, and 
immunogenic derivatives thereof. 



The present invention also provides particular heterochimeric proteins which 



include: 



FsVMuVxHsaMV; or 



10 



FsVRSVxHNsaPP/1; or 



FsVRSVxHNs*a*PIV2, and 



imunogenic derivatives thereof. 



The present invention also provides heterochimeric proteins comprising RSV and 
15 PIV3 proteins not specifically disclosed in W093 14207, which advantageously can 
be expressed from CHO cells. 
These are: 



25 The heterochimeric proteins of the present invention are immunogenic. The term 
immunogenic derivative as used herein encompasses any molecule which is a 
heterochimeric polypeptide which is immunologically reactive with antibodies raised 
to the heterochimeric protein of the present invention or parts thereof or with 
antibodies recognising the F protein of RSV, PIV1, PIV2, PIV3, MV or MuV, the 

30 G protein of RSV, the HN protein of PIV1 , PIV2, PIV3, MuV, the H protein of 
MV, the RSV virus, the PIV1 virus, the PIV2 virus, the PIV3 virus, the MV virus 
or the MuV virus, or which, when administered to a human, elicits antibodies 



20 



FsV (1-526) RSV x HNs'a" (70-572) PIV3; 

FsV (1-492) PIV3 x Gsa" (69-298) RSV; 

Fs + a' (1-526) RSV x HNs'a" (70-572) PIV3 bis; 

FsV (1-526) RSV x HNs'a (70-572) PIV3 ent his, and 

sTPA (1-21) UB (1-74) ent Fsa" (24-526) x HN sa(70-572) PIV3, and 

immunogenic derivatives thereof. 
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recognising the F protein of RSV, PIVl f PIV2, PIV3, MV or MuV, the G protein 
of RSV, the HN protein of PIV1 , PIV2, PIV3, MuV, the H protein of MV, the 
RSV virus, the PIV1 virus, the PIV2 virus, the PIV3 virus, the MV virus or the 
MuV virus. In particular immunogenic derivatives which are slightly longer or 

5 shorter than the heterochimeric proteins of the present invention may be used. Such 
derivatives may, for example, be prepared by substitution, addition, or 
rearrangement of amino acids or by chemical modifications thereof including the 
coupling or for enabling the coupling of the heterochimeric proteins to other carrier 
proteins such as tetanus toxoid or Hepatitis B surface antigen. All such substitutions 

10 and modifications are generally well known to those skilled in the art of peptide 
chemistry. 

Immunogenic fragments of the heterochimeric proteins which may be useful in the 
preparation of vaccines may be prepared by expression of the appropriate gene 
15 fragments or by peptide synthesis, for example using the Merrifield synthesis (The 
Peptides, Vol 2., Academic Press, New York, p3). 

In a further aspect of the invention there is provided recombinant DNA encoding 
the heterochimeric protein of the invention. The recombinant DNA of the invention 
20 may form part of a vector, for example a plasmid, especially an expression plasmid 
from which the heterochimeric protein may be expressed. Such vectors also form 
part of the invention, as do host cells into which the vectors have been introduced. 

In order to construct the DNA encoding a heterochimeric protein according to the 
25 invention, cDNA containing the coding sequences of the RSV, PIV1, PIV2, PIV3, 
MV or MuV fusion and attachment proteins and optionally of the ubiquitin, 
polyhistidine and enterokinase cleavage sites may be manipulated using standard 
techniques [see for example Maniatis T. et al Molecular Cloning, Cold Spring 
Harbor Laboratory, Cold Spring Harbor N.Y. (1982)1 as further described 
30 hereinbelow. 
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In another aspect of the invention there is described a process of enhancing the 
protein expression in mammalian cells by optimization of the codon usage of the 
nucleic acids transfected therein. Optimization of the codon usage involves the 
replacement of at least one non-preferred or less preferred codon in a natural gene 
5 encoding a heterochimeric protein by a preferred codon encoding the same amino 
acid. Highly mammalian-expressed genes have C or G at their degenerative 
position (third base in the codon) whereas the RSV or PIV3-prevalent codons have 
A or T. At least one codon, and more prefereably all the codons of the RSV or 
PrV3 protein can be changed to fit at best the human usage, that is, the one (or 
10 ones) that is the most prevalent as shown below. 



Ala: GCC 


Cys: TGC 


His: CAC 


Met: ATG 


Thr: ACC 


Arg: CGC 
AGG 
CGG 


Gin: CAG 


He: ATC 


Phe: TTC 


Tip: TGG 


Asn: AAC 


Glu: GAG 


Leu: CTG 


Pro: CCC 


Tyr: TAC 


Asp: GAC 


Gly: GGC 


Lys: AAG 


Ser: AGC 
TCC 


Val: GTG 



15 Each amino acid encoded by one of these codons are then considered humanised. 
The ratio between the number of humanised codons versus the total number of 
amino acids gives a percentage of humanisation as shown below. 



1) 


F RSV(l-526)ori|irul 




140/526 = 


27% 


2) 


F RSV (l-423)hunamsed 


+ (424-526)original 


403/526 = 


77% 


3) 


F RSV(|.526)huminual 




489/526 = 


93% 


4) 


F RSV(I-526)oriiinil 


+ HN PIV3 (70-572) original 


258/1029 


= 25% 


5) 


F RSVO-526)huiMniKd 


+ HN PIV3 (70-572) original 


528/1029 


= 51% 


6) 


F RSV(l-526)hum»nisa) 


+ HN prv3 (70-572) h""-ni<n1 




96% 



25 
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The invention also provides DNA encoding a heterochimeric protein or 
immunogenic derivative thereof in which the codon usage of one or more nucleic 
acids has been substantially optimised and a process for expressing said DNA in a 
CHO or insect cell. 

5 

There have been a number of reports that have described a substantial amelioration 
of protein expression in mammalian cells after re-engineering the nucleic acid 
sequence of the heterologous protein to fit the codon usage found in highly 
expressed human genes (Haas J., Park E-C. and Seed B., Codon usage limitation in 

10 the expression of HiV-1 envelope glycoprotein, Current Biology, 1996, 6, n°3, 
315-325 ; Kim C. H., Oh Y. and Lee T.H., Codon optimization for high-level 
expression of human erythropoietin (EPO) in mammalian cells, Gene 199, 1997, 
293-301 ; Zolotukhin S., Potter M. Hauswirth W.W. Guy J. and Muzyczka N. A 
Humanized green fluorescent protein cDNA adapted for high level expression in 

15 mammalian cells. J. of Virology, July 1996, 70, n°7, 4646-4654). 

Vectors comprising such DNA, hosts transformed thereby and the truncated or 
hybrid proteins themselves, expressed as described hereinbelow all form part of the 
invention. 

20 

For expression of the proteins of the invention, plasmids may be constructed which 
are suitable either for transfer into vaccinia virus or transfection into CHO cells, 
insect cells or Vero cells. Suitable expression vectors are described hereinbelow. 
Preferably the proteins of the present invention are expressed in CHO or insect 
25 cells. 

For expression in vaccinia a vaccinia transfer plasmid such as pULB 5213 which is 
a derivative of pSCl 1 (Chakrabati et al, Molecular and Cellular Biology 5, 3403 - 
3409, 1985) may be used. In one aspect the protein may be expressed under the 
30 control of the vaccinia P7 5 promoter. 
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For expression in CHO-K1 cells a glutamine synthetase (GS) vector such as pEE14 
may suitably be used so that the protein is expressed under the control of the major 
immediate early promoter of human cytomegalovirus (hCMV-MIE). Alternatively 
a vector which allows the expression of the coding module as a polycistronic 
5 transcript with the neo selection gene may suitably be used. In one preferred aspect 
the coding module is under the control of the Rous Sarcoma Long Terminal Repeat 
(LTR) promoter. 

Preferably the plasmid for expression in CHO-K1 cells carries a GS expression 
10 cassette suitable for gene amplification using methionine sulphoximine (MSX). 
Alternatively the plasmid for expression in CHO-K1 cells carries a DHFR 
expression cassette suitable for gene amplification using methotrexate (MTX). 

Preferably expression of the heterochimeric protein of the present invention is 
15 carried out in the presence of sodium butyrate and/or dimethyl sulphoxide (DMSO) 
which may enhance gene expression. 

For expression in insect cells a shuttle vector such as pAcUWSl or pAcGP67 may 
be used. In one aspect the protein may be expressed under the control of the 
20 baculovinis plO promoter or the polyhedrin promoter. 

The expression system may also be a recombinant live microorganism, such as a virus 
or bacterium. The gene of interest can be inserted into the genome of a live 
recombinant virus or bacterium. Inoculation and in vivo infection with this live vector 

25 will lead to in vivo expression of the antigen and induction of immune responses. 
Viruses and bacteria used for this purpose are for instance: poxviruses (e.g; vaccinia, 
fowlpox, canarypox), alphaviruses (Sindbis virus, Semliki Forest Virus, Venezuelian 
Equine Encephalitis Virus), adenoviruses, adeno-associated virus, picornaviruses 
(poliovirus, rhinovirus), herpesviruses (varicella zoster virus, etc), Listeria, Salmonella, 

30 Shigella, BCG. These viruses and bacteria can be virulent, or attenuated in various 
ways in order to obtain live vaccines. Such live vaccines also form part of the 
invention. 
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In yet another aspect of the invention there is provided a vaccine composition 
comprising a heterochimeric protein or immunogenic derivative thereof according to 
the invention in combination with a phannaceutically acceptable carrier, a protein 
5 according to the invention for use in vaccinating a mammal and the use of a protein 
according to the invention in the preparation of a vaccine. 

Optionally, and advantageously, the vaccine of the present invention is combined 
with other immunogens to afford a polyvalent vaccine. In a preferred embodiment 
10 the heterochimeric protein is combined with other subcomponents of RSV, PIV1, 
PIV2, PIV3, MuV or MV, e.g. the single proteins F, G, HN or H or homochimeric 
proteins such as RSV FxG, PIV3 FxHN or MuV FxHN. 

In a particular aspect the invention further provides a vaccine composition 
15 comprising a protein according to the invention together with a suitable carrier or 
adjuvant. 

Vaccine preparation is generally described in New Trends and Developments in 
Vaccines, edited by Voller et al, University Park Press, Baltimore, Maryland, 
20 U.S.A., 1978. Encapsulation within liposomes is described, for example by 
Fullerton, U.S. Patent 4,235,877. 

In the vaccine of the present invention , an aqueous solution of the protein(s) can be 
used directly. Alternatively, the protein, with or without prior lyophilisation, can 

25 be mixed, absorbed or adsorbed with any of the various known adjuvants. Such 
adjuvants include, but are not limited to, aluminium hydroxide, muramyl dipeptide 
and saponins such as Quil A. Particularly preferred adjuvants are MPL 
(monophosphoryl lipid A) and 3D-MPL (3 deacylated monophosphoiyl lipid A) [US 
patent 4,912,094], optionally formulated with aluminium hudroxide (EP 0 689 454) 

30 or oil in water emulsions (WO 95/17210). A further preferred adjuvant is known as 
QS21 which can be obtained by the method disclosed in US patent 5,057,540. Use 
of 3D-MPL is described by Ribi et al. in Microbiology (1986) Levie et al. feds) 
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Amer. Soc. Microbiol.Wash. D.C., 9-13. Use of Quil A is disclosed by Dalsgaard 
et aL, (1977), Acta Vet Scand, 18, 349. Use of combined 3D-MPL and QS21 is 
described in WO 94/00153 (SmithKline Beecham Biologicals s.a). QS21 may be 
advantageously formulated with cholesterol containing liposomes, wherein 3D-MPL 
5 is present either in solution or incorporated in the membrane, as described in WO 
96/33739. 

As a further exemplary alternative, a heterochimeric protein of the invention or an 
immunogenic fragment thereof can be encapsulated within microparticles such as 

10 liposomes or associated with oil-in- water emulsions. Encapsulation within 
liposomes is described by Fullerton in US patent 4,235,877. In yet another 
exemplary alternative, a heterochimeric protein according to the invention or an 
immunogenic fragment thereof can be conjugated to an immunostimulating 
macromolecule, such as killed Bordetella or a tetanus toxoid. Conjugation of 

15 proteins to macromolecules is disclosed, for example by Likhite in patent 4,372,945 
and Armor et al in US patent 4,474,757. 

The amount of the protein of the present invention present in each vaccine dose is 
selected as an amount which induces an immunoprotective response without 

20 significant, adverse side effects in typical vaccines. Such amount will vary 
depending upon which specific immunogen is employed and whether or not the 
vaccine is adjuvanted. Generally, it is expected that each dose will comprise 
1-lOOOng of protein, preferably 1-200 jig. An optimal amount for a particular 
vaccine can be ascertained by standard studies involving observation of antibody 

25 titres and other responses in subjects. 

The following examples and the attached figures (explained below) illustrate the 
invention. 

30 In the Figures: 

Figure 34A shows the impact of humanisation on the level of expression of FrHNp, 
where: 
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FhHNElO = product expressed by the pEE!4FhHN transfected clone E10; 
FhHNE7 = product expressed by the pEE14FhHN transfected clone E7; 
FHNbis = product expressed by the pEE14FHN transfected clone; 
+but = 2mM Nabutyrate has been added to the cell medium, 3 days before 
5 harvest; 

pEE14 = negative control; 

Fdroso = pruified Fa- (drosophila derived); the standard protein in this ELISA 
assay wherein lul of standard corresponds to lng of product. 
Figure 34B shows humanisation impact on the level of expression of Fr^ICS^, 
10 where the level of expression was determined by ELISA. Fdroso = purified Fa- 
(drosophila derived) that is the standard protein in this ELISA assay, lul of standard 
corresponds to lng of product. 
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Example 1 

In order to vaccinate with a single immunogen, heterochimeric DNA molecules 
5 were constructed combining extracellular domains of the F and the attachment 
protein for each virus. DNA constructs for the PIV3 and MuV have already been 
described in WO9306218 and WO9425600, respectively. The DNA molecule 
combining the extracellular domains of the RSV F and G proteins were constructed 
as described below. 

10 

The DNA pieces were first inserted into the mammalian expression vector based on 
the replicon of the Semliki Forest Virus (pSFVl). This expression system does not 
lead to a stable expression mammalian cell line but, however gives an indication 
whether or not the chimeric protein is expressed and whether the product is 
15 effectively secreted in the culture medium, which is advantageous for the 
purification procedure. 

Stable expression in the culture medium of mammalian cell lines is preferred to 
obtain good quality and quantities of paramyxovirus glycoproteins. All the chimeric 

20 modules have been inserted in the shuttle vector, the pEE14, which integrates in the 
genome of mammalian cells such as CHO-K1. A quite good expression level was 
obtained with the RSV FxG homochimeric recombinant protein, however negligible 
expression was obtained for the FxHN recombinant homochimeric protein of either 
PIV3 or MuV. Expression of heterochimeric proteins was obtained from CHO 

25 cells. 

Thus by constructing heterochimeric DNA molecules combining the extracellular 
domains of the F protein of one virus linked to the extra cellular domain of the HN 
or G protein of another virus and inserting them into the pEE14 vector for CHO 
30 expression it has been possible to raise the expression level of these proteins. These 
proteins may be used to achieve protection against at least two paramyxoviridae 
viruses with a single immunogen. 
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Some of the chimeric molecules have been inserted into the shuttle vectors, 
pAcUWSl and pACGP67, which integrate in the genome of bacterial and 
lepidopteran cells. Surprisingly good expression of heterochimeric proteins was 
5 obtained from insect cells. 

Vector construction 
Preliminary Constructs 

10 

a) Plasmid pNIV2819 

Starting from plasmid pNIV2801, a cDNA clone encoding inter alia the F protein 
of RSV (type RSS-2; received from Dr Pringle, UK) we reconstructed a cDNA 
15 module coding for the F protein lacking the membrane anchor sequence. 

Plasmid pNIV2801 was digested with Pstl in order to recover a 1416 bp DNA piece 
encoding amino acid residues 18 to 489 of the F protein. Synthetic 
oligonucleotides, specifying respectively the sequences for amino acids 1 to 17 and 

20 490 to 526, were used to produce the corresponding cDNA fragments by the 
polymerase chain reaction performed with pNIV2801 DNA as template. The 
primers were designed to generate also unique flanking restriction sites useful for 
subsequent cloning steps. The coding module was assembled, by ligation, from the 
three DNA pieces described above and introduced into the standard cloning vector 

25 pUC19, to create plasmid pNTV28 19. This plasmid encodes the RSV F protein 
carrying its signal sequence but lacking its anchor sequence (figure 1). 

b) Plasmid pNTV 2820 

30 The cDNA module encoding the full length F protein of RSV was constructed as 
follows. Using two synthetic oligonucleotides, the polymerase chain reaction was 
performed with pNIV2801 DNA as template to generate a 273 bp DNA fragment 
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encompassing the sequence coding for aa 490 to aa 574 of the F protein, the stop 
codon and unique restriction sites useful for subsequent cloning steps. This 
fragment was digested with Nsil and EcoRl and substituted for the Nsil-EcoRl DNA 
piece present in the coding module of pNTV2819 (figure 2). The resulting plasmid, 
5 pNIV2820, thus encodes the RSV F protein carrying both signal and membrane 
anchor sequences. 

c) Plasmid pNIV2841 

10 In this construction, the DNA coding for aa 165 to 176 of the G protein of RSV is 
fused to the DNA encoding the RSV Fs + a" protein. This part of the G protein is 
conserved among both subgroups of RSV. 

The starting material, pNrV2819, was digested by Ncol and Smal yielding a 1601 
15 bp fragment. This fragment was subcloned into the Ncol and Mscl sites of 

pNTV103 (a derivative of pULB1221, see European Patent Application No. 186643) 
leading to pNIV2844. This subcloning allowed to place the translation initiation site 
of the F protein in a more favourable context according to the model proposed by 
Kozak (Kozak M, Nature 308, 241-246, 1984). 

20 

A 1605 bp fragment was recovered from pNTV2844 by digestion with Kpnl and Sail 
and introduced by ligation into pUC19 digested with Kpnl and Sail, creating 
pNIV2840. 

25 Two complementary synthetic oligonucleotides specifying the sequence for amino 
acids 165 to 176 of the G protein followed by a stop codon and flanked by Nsil, 
BamHi, EcoRl and Hindlll sites were hybridized. The 55 bp resulting fragment was 
cloned into the pNF/2840 digested by Nsil and Hindlll, thus replacing a 142 bp 
DNA sequence encoding amino acids 491 to 526 of the F protein. The resulting 

30 recombinant plasmid, pNIV2841, thus contains the sequence coding for amino acids 
1 to 490 of the F protein followed by amino acids 165 to 176 of the G protein 
(figure 3). 
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Vector Construction 

I) For transfer into the pSFVl vector 

5 a) The RSV fusion protein lacking the membrane anchor domain fused to the 
MuV hemagglutinin-neuraminidase lacking the signal-anchor domain, (1- 
526) HN^v (60-582). 

Plasmid pNIV2875, a derivative of pNIV2820 which carries the DNA coding for 
10 the F protein of RSV in which the Spel restriction site has been eliminated by site- 
. directed mutagenesis into the pUC19 vector, has been digested by HindlU and 
BspHl, and a 1618 bp fragment has been isolated. Plasmid pNIV3229, a derivative 
of pNTV3215 whose construction has been already described in WO9425600 and 
which carries the DNA coding for the HN protein of MuV into the pUC19 vector, 
15 has been digested with Bbsl and BamHl; a 1580 bp fragment has been isolated. 
Both fragments were linked together by two complementary synthetic BspHl-Bbsl 
oligonucleotides (Fig 4A) restoring the coding sequence of the chimeric molecule 
and were inserted into the BamHl-Hindlll site of the pUC19 vector leading to 
pNIV4102. (Fig4B) After the sequencing of the junction regions, the chimeric 
20 cassette was retrieved from pNIV4102 by a BamHl digestion and was inserted into 
the BamHl site of the pSFVl vector (Liljestrom, P. and Garoff,H. (1991) 
Bio/Technology 9, 1356). The resulting plasmid, pNTV4104, contains into the 
pSFVl vector the sequence coding for amino acids 1 to 526 of the RSV F protein 
followed by amino acids 60 to 582 of the MuV HN protein. (Fig4C) 

25 

b) The RSV fusion protein lacking the membrane anchor domain fused to the 
PIV3 hemagglutinin-neuraminidase lacking the signal-anchor domain, F^ (1- 
526) HN™ (70-572). 

30 Plasmid pIBI-HN , a cDNA clone containing the complete coding sequence of 
protein HN of PIV3 as well as its 3* non coding sequence (received from Dr.K. 
Dimock, University of Ottawa, Canada), has been digested by Asel and BamHl and 
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a 1468 bp fragment has been isolated. Plasmid pNTV2875 (see supra), which carries 
the DNA encoding the F protein of RSV, in which the unique Spel site has been 
eliminated by site-directed mutagenesis, inserted into the pUC19 vector, has been 
digested by BamHl and BspHl, and a 1588 bp fragment has been isolated. Both 
5 fragments were linked together by two complementary synthetic BspHl-Asel 

oligonucleotides (FigSA) and were inserted into the BamHl site of the pUC19 vector 
leading to pNIV4105 or to pNIV4109 (FigSB) depending of the orientation of the 
chimeric module in the vector. After the sequencing of the junction region, the 
chimeric cassette was retrieved by a BamHl digestion from pNTV4109 and inserted 
10 into the BamHl site of the pSFVl vector. The resulting plasmid, pNIV41 10, 

contains, inserted into the pSFVl vector, the sequence coding for amino acids 1 to 
526 of the RSV F protein followed by amino acids 70 to 572 of the PIV3 HN 
protein. (Fig5C) 



15 c) The PIV3 fusion protein lacking the membrane anchor domain fused to the 
RSV attachment protein lacking the signal-anchor domain, (1-492) Grsv 
(69-298). 

Plasmid pNIV3310, described in WO9306218 which carries the DNA coding for 
20 amino acids 1 to 484 of the PIV3 F protein followed by amino acids 87 to 572 of 
the PIV3 HN protein into the pIBI vector, was digested by EcoRI and Bg/II, and a 
1435 bp fragment has been isolated. Plasmid pNTV2850, which carries the RSV G 
protein into the pUC19 vector, has been digested by Maelll and Hindlll, and a 694 
bp fragment has been isolated. Both fragments were then linked together by using 
25 two complementary BgHl-Maelll synthetic linkers (Fig6A) and were inserted into 
the EcdKl-HindlU sites of pUC19 vector leading to pNIV4103 (Fig6B). The 
chimeric module was then retrieved from the pUC19 vector by a BamHhHindlll 
digestion. After treating the protruding ends with the Klenow polymerase, the 
chimeric cassette has been inserted into the Smal site of pSFVl vector. The 
30 resulting plasmid pNrV4106, thus contains the sequence coding for amino acids 1 to 
492 of the F protein of PIV3 followed by amino acids 69 to 298 of the G protein of 
RSV inserted into the pSFVl vector (Fig6C). 
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d) The PIV3 fusion protein lacking the membrane anchor domain linked to the 
MuV hemagglutinin-neuraminidase lacking the signal-anchor domain, (1- 
493) HN MnV (60-582). 

5 

Plasmid pNIV3310 (see supra, FHN„vj in pIBI) was digested by £coRI and BglU. 
and a 1435 bp fragment was isolated. Plasmid pNIV3229 (see supra, HN MuV into 
pUC19) was digested by Bbsl and HindUl, and a 1610 bp fragment was isolated. 
Both fragments were linked together by adding two synthetic complementary linkers 

10 specifying a BgRl and a Bbsl ends (Fig7A) into the pUC19 vector leading to 

pNIV4117 (Fig7B). After sequencing the junction region, the chimeric cassette was 
retrieved from the pUC19 vector by a BamHl digestion and was inserted into the 
BamHl site of the pSFVl vector. The resulting plasmid pNIV4118 encodes, cloned 
in the pSFVl vector, the DNA sequence specifying amino acids 1 to 493 of the 

15 PIV3 fusion protein linked to amino acids 60 to 582 of the MuV HN protein 
(Fig7C). 

e) The MuV fusion protein lacking its membrane anchor domain linked to the 
RSV attachment protein lacking its signal-anchor domain, F MuV (1-482) Grsv 
20 (69-298). 

Plasmid pNIV3221 , described in WO9425600 which carries the sequence encoding 
amino acids 1 to 462 of the MuV fusion protein within the pUC19 vector, has been 
digested with £coRI and BsrFl, and a 771 bp fragment has been purified. Plasmid 

25 pNIV3221 has been also digested with BsrFl and Pstl, and a 628 bp fragment has 
been isolated. Plasmid pNIV2850 (see supra, into the pUC19) has been 
digested with Mae/// and Hind/// and a 694 bp fragment has been isolated. The 
three fragments were linked together; the F^/G^ junction was created by adding 
to the ligation reaction two synthetic complementary oligonucleotide specifying Pstl 

30 and MaeUl sites (Fig8A) , and were inserted into the EcdRl-HindlU sites of the 
pBluescript vector leading to pNIV4113(Fig8B). The chimeric cassette was 
recovered from pNIV41 13 by a Asp718l digestion and, after treating the protruding 
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ends with the Klenow polymerase, was inserted into the Smal site of the pSFVl 
vector. The resulting plasmid, pNTV4114 contains into the pSFVl vector the 
sequence specifying amino acids 1 to 482 of the MuV F protein linked to amino 
acids 69 to 298 of the RSV G protein (Fig8C). 

5 

f) The MuV fusion protein lacking its membrane anchor domain linked to the 
PIV3 hemagglutinin-neuraminidase lacking its signal-anchor domain, F MuV (1- 
482) HNpjvj (54-572). 

10 Plasmid pNIV4113 (see supra, F MuV x G^ v in pBluescript) was digested by Bsal and 
BamHl, a 1469 bp fragment was isolated. Plasmid pNTV3308, described in 
WO9306218 and which carries the DNA sequence specifying amino acids 1 to 31 
followed by amino acids 54 to 572 of the PIV3 HN protein into the pIBI vector, 
was digested by £a?RI and BamHl and a 1569 bp fragment was isolated. Both 

15 fragments were linked together by two synthetic complementary linkers specifying 
Bsal and EcdSl sites (Fig9A) into the BamHl site of pBluescript leading to 
pNIV4115 (Fig9B). The chimeric module was recovered from pNTV4115 by a 
BamHl digestion and was inserted into BamHl site of pSFVl vector. The resulting 
plasmid, pNIV4116, encodes, in the pSFVl vector, the sequence specifying amino 

20 acids 1-482 of the MuV F protein fused to amino acids 54 to 572 of the PIV3 HN 
protein (Fig9C). 

g) The RSV fusion protein lacking its membrane anchor domain linked to the 
RSV attachment protein lacking its signal-anchor domain, Frsv (1-526) Grsv(69- 

25 298). 

Plasmid pNIV2857 (Figl6A), a derivative of pNIV2841 and which contains the 
DNA sequence coding for amino acids 1 to 526 of the RSV fusion protein linked to 
amino acids 69 to 298 of the RSV attachment protein, has been digested by Asp718I 
30 and Hindlll and a 2180 bp fragment has been isolated. After treating the protruding 
extremities with Klenow's polymerase, this fragment has been inserted in the Smal 
site of the pSFVl vector. The resulting plasmid pNIV2870 t contains in the pSFVl 
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vector, the DNA sequence coding for amino acids 1 to 526 of the RSV fusion 
protein linked to amino acids 69 to 298 of the RSV attachment protein (Figl6B). 

H) For transfection into CHO cells 

a) The RSV fusion protein lacking the membrane anchor domain fused to the 
MuV hemagglutinin-neuraminidase lacking the signal-anchor domain, Frsv (1- 
526) HN MuV (60-582). 

Plasmid pNF/4102, (FiglOA, see supra, Frsv x HN MoV into the pUC19 vector) has 
been digested with BamHl, and after treating the protruding ends with the Klenow 
polymerase, the chimeric module has been inserted into the Smal site of the 
glutamine synthetase (GS) vector, pEE14 (Cockett et al t 1990, Bio/Technology 8, 
662-667). The resulting plasmid pEE14 Fs + a* RSV x HN s'a* MuV contains 
sequences coding for amino acids 1 to 526 of the RSV F protein fused to amino 
acids 60 to 582 of the MuV HN protein under the control of the major immediate 
early promoter of the human cytomegalovirus (hCMV-MIE) (Fig 1 0B). 

b) The RSV fusion protein lacking its membrane anchor domain linked to the 
PIV3 hemagglutinin-neuraminidase lacking its signal-anchor domain, F^ (1- 
526) HN PIV 3 (70-572). 

Plasmids pNIV4105 and pNIV4109 (FigllA and B, see supra, F^y x HN PIV3 into 
the pUC19 vector) were digested by EcoHI and Xhol and a 2032 bp as well as a 
1064 bp fragments were isolated. Both fragments were inserted together into the 
EcoW site of pEE14. The resulting plasmid pEE14 Fs + a' RSV x HNs a* PIV3 
contains sequences coding for amino acids 1 to 526 of the RSV F protein fused to 
amino acids 70 to 572 of the PIV3 HN protein under the control of the hCMV 
promoter (FigllC). 
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c) The PIV3 fusion protein lacking the membrane anchor region linked to the 
RSV attachment protein lacking the signal-anchor domain, Fp^ (1-492) G^ 
(69-298). 

5 Plasmid pNIV4103 (Figl2A, see supra, F ?m x G^ into the pUC19 vector) was 
digested by Hindttl and a 2180 bp fragment was isolated. After treating the 
protruding extremities with the Klenow polymerase, the chimeric module was 
inserted into the Smal site of the pEE14 vector. The resulting plasmid, pEE14 Fs + a" 
PIV3 x Gs aRSV, contains, under the control of the hCMV promoter, the sequence 
10 encoding amino acids 1 to 492 of the PIV3 F protein followed by amino acids 69 to 
298 of the RSV G protein (Fig 12B). 

d) The PIV3 fusion protein lacking the membrane anchor domain fused to the 
MuV hemagglutinin-neuraminidase lacking the signal-anchor domain, (1- 

15 493) HN MuV (60-582). 

Plasmid pNIV4117 (FigDA, see supra, F PIV3 HN MuV into the pUC19 vector) was 
digested with Hindlll and a 31 19 bp fragment was isolated and inserted into the 
Hindttl site of the pEE14 vector. The resulting plasmid, pEE14 Fs + a" PIV3 x HNs" 
20 a MuV, contains under the control of the hCMV promoter a sequence encoding 
amino acids 1 to 493 of the PIV3 fusion protein fused to amino acids 60 to 582 of 
the MuV HN protein (Figl3B). 

e) The MuV fusion protein lacking its membrane anchor domain fused to the 
25 RSV attachment protein lacking its signal-anchor domain, F MuV (1-482) 

(69-298). 

Plasmid pNIV4113 (Figl4A, see supra, F MuV G^ into the pBluescript vector) has 
been digested Asp718l, the protruding ends have been treated by the Klenow 
30 polymerase. A 2200 bp fragment has been isolated and inserted into the Smal site of 
pEE14. The resulting plasmid, pEEI4 Fs + a" MuV x GsaRSV, has, under the 
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control of the hCMV promoter, the sequence encoding amino acids 1 to 482 of the 
MuV F protein followed by amino acids 69 to 298 of the RSV G protein (Figl4B). 

f) The MuV fusion protein lacking its membrane anchor domain fused to the 
5 PIV3 hemagglutinin-neuraminidase lacking its signal-anchor domain, F^y (1- 
482) HN PIV 3 (54-572). 

Plasmid pNTV4115 (Figl5A, see supra f F MuV x HN pm into the pBluescript vector) 
has been digested with EcoRl and a 3040 bp fragment has been inserted into the 
10 EcoKl site of the pEE14 vector. The resulting plasmid, pEE14 FsV MuV x HNs*a* 
PIV3, contains, downstream to the hCMV promoter region, a sequence coding for 
amino acids 1 to 482 of the MuV F protein followed by amino acids 54 to 572 of 
the PIV3 HN protein (FiglSB). 

15 g) The RSV fusion protein lacking its membrane anchor domain linked to the 
RSV attachment protein lacking its signal-anchor domain, (1-526) 0^(69- 
298). 

Plasmid pNIV2857 (Figl7A), a derivative of pNIV2841 and which contains the 
20 DNA sequence coding for amino acids 1 to 526 of the RSV fusion protein linked to 
amino acids 69 to 298 of the RSV attachment protein, has been digested by Asp718I 
and Hindlll and a 2180 bp fragment has been isolated. After treating the protruding 
extremities with Klenow's polymerase, this fragment has been inserted the Smalsixt 
of the pEE14vector. The resulting plasmid, pEE14 Fs + a RSV x Gsa' RSV, contains 
25 under the control of the hCMV promoter the DNA sequence coding for amino acids 
1 to 526 of the RSV fusion protein linked to amino acids 69 to 298 of the RSV 
attachment protein (Figl7B). 

h) The original RSV fusion protein lacking the membrane anchor domain 
30 linked to the PIV3 hemagglutin-neuraminidase lacking the signal-anchor 
domain, (1-526) HN PIY3 (70-572) bis. 
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Plasmid pNTV2852, a derivative of pNTV2820 which carries the DNA encoding the 
RSV F protein where the translation initiation site is in a more favourable context 
according to the model proposed by Kozak (Kozak M, Nature 308, 241-246, 1984), 
has been digested BamHI and BspHI, and a 1588 bp fragment has been isolated. 

5 

Plasmid pIBI-HN, a cDNA clone containing the complete coding sequence of the 
HN protein of PIV3 (received from Dr. K. Dimock, University of Ottawa, Canada) 
has been digested by Asel and BamHI and a 1468 bp has been isolated. 

10 Both fragments were linked together by two complementary synthetic BspHI-Asel 
adaptators (Fig 18 A) and were inserted into the BamHI site of the pUC19 vector 
leading to pNIV4120 (Figl8B). 

After the sequencing of the junction region, the chimeric cassette was retrieved by a 
15 BamHI digestion from pNIV4120 and inserted into the BamHI compatible Bell site 
of the pEE14 vector. The resulting plasmid pEE14 FsVRSV x HNs'a PIV3 bis 
contains the sequences coding for amino acids 1 to 526 of the RSV F protein fused 
to amino acids 70 to 572 of the PIV3 HN protein under the control of the hCMV 
promoter (Figl8C). 

20 

This construct differs from the earlier pEE14 Fs + a* RSV x HNs'a' PIV3 construct 
(Il-a) in the F coding region. In FrsvHNpjvj bis, the nucleic acid sequence found in 
F RSV HN P , V3 , ATG GAT CTG (those codons are specifying aa Metl, Asp2 and Leu3) 
and ACC AGT (specifying aa Thr54 and Ser 55) is replaced by the original 
25 sequence of the RSV F protein that is ATG GAG TTG (specifiyng aa Metl , Glu2, 
Leu3) and ACT AGT (specifying Thr54 and Ser55). 

i) The original RSV fusion protein lacking the membrane anchor domain linked 
to the PIV3 hemagglutinin-neuraminidase lacking the signal-anchor domain 
30 with, at the C-terminal part, a polyhistidine tail preceded by the enterokinase 
cleavage site, (1-526) HN PIV3 (70-572) en his 
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Plasmid pIBI-HN, a cDNA containing the PIV3 HN protein coding sequence (see 
supra) has been digested by PstI and Sphl. A 4588 bp fragment has been isolated 
and linked to complementary synthetic Pstl-SphI adaptators (Fig 19 A). 

5 After the sequencing of the junctions as well as the synthetic linkers, the resulting 
plasmid pNIV3340 has been digested by Xhol and BamHI and a 1 121 bp fragment 
has been isolated (Figl9B). 

Plasmid pNTV4120 (see supra) has been digested by Xhol and BamHI and a 2017 bp 
10 fragment has been isolated (Figl9C). 

Both fragments were linked together and inserted into the BamHI compatible Bell 
site of the pEE14 vector. The resulting plasmid pEE14 FRSVs + a" x HNs'a* en his 
contains, under the control of the hCMV promoter, sequences coding for amino 
15 acids 1 to 526 of the RSV fusion protein fused to the amino acids 70-572 of the 
PIV3 HN protein fused to the enterokinase cleavage site,({Asp} x4 Lys) followed 
by a polyhistidine tail ({his}x6) and a stop codon (Figl9D). 



20 j) The signal domain of the tissue plasminogen activator fused to the yeast 
ubiquitin followed by the enterokinase cleavage recognition site and the 
original RSV fusion protein lacking its membrane signal and anchor domains 
linked to the PIV3 hemagglutin-neuraminidase lacking the signal-anchor 
domain, sTPA(l-21) UB(l-74) ent (24-526) HN^ (70-572)bis. 

25 

1) The signal domain of the tissue plasminogen activator fused to the yeast 
ubiquitin. 

A 208 bp fragment corresponding to amino acid 1 to 76 of the ubiquitin protein of 
30 Saccharomyces cerevisiae was isolated by a digestion of pNIV3475 ( a derivative of 
YEPUBSTUALL, a yeast 2 n vector backbone carrying the yeast ubiquitin) with 
BamHI and Xbal (Fig 20A). 
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Plasmid JW4304 (received from J. Mullins, University of Washington, U.S.A) 
which encodes the signal domain of the tissue plasminogen activator (sTPA) was 
digested by Nhel and BamHI and a 51 lSbp was isolated. Both fragments were 
5 linked together using two synthetic complementary Nhel-Xbal adaptators (Fig20B). 
The resulting plasmid pNIV4121 was digested by Hindlll and BamHI. A 330 bp 
fragment was isolated and inserted into the Hindlll and BamHI sites of the 
pBluescript vector. The resulting plasmid pNTV4122 contains the DNA sequence 
specifying the signal domain of the tissue plasminogen activator followed by an 
10 alanine and a serine residue (those two amino acids are known to produce a good 
leader cleavage) fused to the yeast ubiquitin (Fig 20C). 

2) The signal domain of the tissue plasminogen activator linked to the yeast 
ubiquitin followed by the enterokinase cleavage recognition site and amino acid 
15 24 to 55 of the original fusion protein of RSV. 

Plasmid pNIV4122 (Fig 21A, see supra) was digested by AflH and Spel. A 3212 bp 
fragment was isolated and linked to synthetic complementary Aflll-Spel adaptators 
(Fig21B). The entire module was then sequenced. The resulting plasmid pNIV4123 
20 encodes the signal domain of the tissue plasminogen activator linked to the N- 

terminal 74 aa of the yeast ubiquitin followed by the recognition site of enterokinase 
{(Asp)4 Lys} and amino acid 24 to 55 of the original fusion protein of RSV 
(Fig21C). 

25 3) The signal domain of the tissue plasminogen activator linked to the yeast 
ubiquitin followed by the enterokinase cleavage recognition site and the RSV 
fusion protein linked to the PIV3 hemagglutin-neuraminidase lacking their 
membrane domains. 

30 Plasmid pNIV4123 (Fig 22A, see supra) was digested by Hindlll treated by the 
Klenow polymerase and digested by Spel. A 408 bp fragment has been isolated. 
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Plasmid pNTV4120 (Fig 22B, see supra) has been digested by Xbal, treated by the 
Klenow polymerase, and digested by SpeL A 5620 bp fragment has been isolated. 

Both fragment have been linked together to generate pNIV4124 (Fig 22C). 

5 

The entire coding module was retrieved from pNTV4124 by a digestion with Xbal 
and EcoRI and was inserted into the Xbal and EcoRI sites of the pEE14 expression 
vector. The resulting plasmid pEE14 sTPA x UBI x EN x FsaRSV x HNsa"PIV3, 
contains, under the control of the hCMV promoter, the sequence coding for aal-21 
10 of the tissue plasminogen activator followed by an alanine and a serine residue, by 
the 74 N-terminal amino acids of the yeast ubiquitin, by the recognition cleavage 
site of the emerokinase ({Asp}4 Lys), by aa 24-526 of the original RSV fusion 
protein and by aa 70-572 of the hemagglutin-neuraminidase of PIV3. 

15 ID) For transfection into Insect Cells 

a) The original RSV fusion protein lacking the membrane anchor domain 
linked to the PiV3 hemagglutin-neuraminidase lacking the signal-anchor 
domain, (1-526) HN^ (70-572) bis. 

20 

Plasmid pNIV4120 (FIG 23A) was digested by BamHIznd a 3114 bp fragment was 
isolated and inserted into the BamHI site of the baculovirus transfer vector, 
pAcUWSl (PharMingen). The resulting plasmid pNIV4132 (Fig 23B) contains, 
under the control of the polyhedrin promoter, the sequence coding for amino acids 
25 1-526 of the RSV F protein fused to amino acids 70-572 of the PiV3 HN protein. 

b) The baculovirus gp67 signal peptide fused to the original RSV fusion protein 
lacking both membrane signal and anchor domain linked to the PiV3 
hemagglutin-neuraminidase lacking the signal-anchor domain, SGP67FRSV (25- 

30 526) HN KV3 (70-572) bis. 
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Plasmid pNIV4120 (FIG 24A, see supra) was digested by BamHI and Spel and a 
2939 bp fragment was isolated, linked to two complementary synthetic BamHI-Spel 
adaptators and inserted into the BamHI site of the baculovirus transfer vector, 
pAcGP67A (PharMingen). The resulting plasmid pNTV4136 (Fig 24) contains, 
5 under the control of the polyhedrin promoter, the sequence coding for amino acids 
1-38 of the Baculovirus gp67 protein, followed by an Alanine and an Aspartate 
linked to amino acids 25-526 of the RSV F protein fused to amino acids 70-572 of 
the PiV3 HN protein. 

10 Expression in eukaryotic cells 

A) via the pSFVl vector 

The pSFVl vector is based on the Semliki Forest Virus (SFV) replicon. The DNA 
15 of interest is cloned into the pSFVl vector that serves as a template for in vitro 
synthesis of recombinant RNA. The RNA is transfected into mammalian cells such 
as BHK-21 cells. The recombinant RNA in the cells drives its own replication and 
capping resulting in production of heterologous protein. 

20 Plasmids pNIV2870 was digested with Pvul; pNIV4106, pNIV41 10, pNIV41 14, 
pNIV41 16 and pNIV41 18 were digested with Spel prior to RNA transcription. 
After a phenol extraction followed by an ethanol precipitation, 2 /xg of linearized 
DNA was used as a template for RNA production. About 5 /xg RNA was used to 
transfect, by electroporation, about 8 10 6 BHK-21 cells. All experimental 

25 procedures for RNA production and cell transfection are detailed in Liljestrom and 
Garoff (Bio/Technology, 1991, 9, 1356). 

After 24 h to 48 h post-electroporation, cells and spent culture medium have been 
collected for ELISA and radioimmunoprecipitation assays. 
30 a) pNIV4104, HN MuV 
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EUSA were done using mAb 2072 anti-HN MuV (Orvell, 1984, 7. Immunology 
132, 2622-2629) or 20RG45, a goat anti-RSV serum (Fitzgerald, U.S.A.) to coat 
the microtiter plates and a rabbit polyclonal anti-SBL-1 (MuV) serum or mAbl9 
anti-F RSV (G.Taylor, Inst, of Animal Health, Compton Lab., U.K.) as capture 
5 antibody. 

Radioimmunoprecipitation of the 33 S-methionine labelled product was done using 
mAb2072 (Orvell) and products were resolved onto 7.5% SDS-PAGE. 

10 b) pNIV4110, Frsv HNpro 

ELISA were done using anti-RSV goat serum 20RG45 or mAb anti-HNj™ 4830 
(Rydbeck et aU /. Gen. Virol 67, 1531-1542, 1986) to coat microtiter plates and 
mAbl9 anti-F RSV (G.Taylor) or rabbit anti-PIV3 (E.Norrby, Stockholm) serum as 
15 a capture antibody. 

Radioimmunoprecipitation was done using anti-HN PIV3 mAb4830. 

c) pNIV4106, F prv3 Grsv 

20 

ELISA were done using mAb anti-F PIV3 4549 (E.Norrby, Stockholm) or mAb and 
G^ 858-2 (Chemicon, U.S.A.) to coat microtiter plates and a rabbit anti-PIV3 
serum as a capture antibody. 

25 Radioimmunoprecipitation was done using mAb anti-F PIV3 3283 (Behringwerke). 

d) pNIV4118, F PIV 3 HN MuV 

ELISA plates were coated with anti-F PIV3 mAb 1031215 (Norrby) or with mAb 
30 2072 anti-HN MuV (Orvel) and rabbit anti-PIV3 sera or rabbit anti-MuV sera were 
used as capture antibody. 

Immunoprecipitation of labelled product was done using mAb 2072 anti-HN MuV. - 
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e)pNIV4114 t F Mtl vxG RSV 

EUSA plates were coated with anti-F MuV monoclonal 5414 (Orvell) or anti G^ 
5 mAb (Chemicon) and a rabbit anti-SBL-1 serum was used as a capture antibody. 

0 pNIV4116, F MuV x HN^ 

EUSA plates were coated with anti-F MuV mAb 5414 (Orvell) or mAb anti-HN 
10 PIV3 4830 (Norrby) and rabbit anti-SBL-1 serum or a rabbit anti-PIV3 serum as a 
capture antibody. 

g) pNIV2870, F^x Grsv 

15 EUSA were done using 20RG45, a goat anti-RSV serum (Fitzgerald, U.S.A.) to 
coat the microliter plates and mAbl9 anti-F RSV (G.Taylor, Inst, of Animal 
Health, Compton Lab., U.K.) as capture antibody. 



20 B) Expression in CHO cells (stable transfonnants) 

All recombinant plasmids were transfected by calcium phosphate coprecipitation 
into CHO-KI cells, using 20 fig DNA per 1.25 10 6 cells. The CHO-KI cells were 
grown in GMEM-S medium. The GS transfectants were selected by adding 25 fiM 

25 methionine sulfoximine to the culture medium two days after transfection. After ten 
to fourteen days, resistant colonies were picked and transferred into 96 wells plates. 
Each transfonnant was then transferred into 24 wells plates and subsequently to 80 
cm 2 flasks. The GS transformants were assayed for the recombinant products when 
cells reached about 80% confluency. The procedure follows the one described in 

30 Cockett et al (Bio/Technology, 1990, 5, 662-667). 
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ELISA and immunoprecipitation of radiolabelled products were done using the same 
procedures as the ones described above for the pSFVl system. 



Results 
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Expression in Insect cells 

10 a) Expression in lepidopteran cells. 

The vector pAcUWSl is a shuttle vector for bacteria and lepidopteran cells. A 
heterologous protein coding sequence can be inserted downstream the baculoviius 
plO promoter or either downstream the polyhedrin promoter. 

15 

The pAcGP67 vector is a shuttle vector for bacteria and lepidopteran cells that 
contains the gp67 signal sequence upstream a multiple cloning site. A heterologous 
gene can be inserted in one of the cloning site and will be expressed as a gp67 
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signal peptide fusion protein under the control of the polyhedrin promoter. The 
gp67 signal peptide mediates the secretion of the recombinant protein. 

Either pAcUW51 or pAcGP67 recombinant plasmid can be transfected along with 
5 baculovirus linearised DNA into Sf9 cells (Baculogold DNA, PharMingen). This 
leads to the generation of a recombinant baculovirus stock.. The expression of the 
recombinant heterologous protein is obtained by infecting insect cells with the 
recombinant baculovirus 

10 Plasmid pNIV4132 or plasmid pNIV4136 were transfected with baculovirus linearised 
DNA into Sf9 cells. Recombinant baculovirus 3546 (derived from cells transfected by 
pNIV4132) or 5V (derived from cells transfected by pNTV4136) were plaque purified 
and were used to infect Sf9 or High Five™ cells (Invitrogen). 24h to 72 h post- 
infection the cells and the spent culture medium have been collected for EJJSA and 

15 Western blot analysis. 

ELISA were done using anti-RSV goat serum 20RG45 (Fizgerald) to coat microtiter 
plates and mAbl9 anti-F RSV (G.Taylor) as a capture antibody. 

20 Western blots were done using mAbl9 anti-F RSV (G.Taylor) or using anti-RSV 
goat polyclonal serum 20RG45 (Fizgerald). 

The spent medium from cells infected by either baculovirus 3546 or by 5V tested 
positive in ELISA. The level of expression, depending on the host cell line (SF9 or 
25 High Five), multiplicity of infection, medium (fetal calf serum supplemented or 
serum free synthetic medium) was at least ten times higher than the one obtained 
with a recombinant CHO-KI clone obtained by transfection with pEE14 F^ (1- 
526) HN PiV3 (70-572)bis . 

30 In addition, the spent medium of the baculovirus infected cells reacted positively in 
Western blot. A band in the vicinity of 1 lOkDa was present in the immunoblots. These 
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results confirm the secretion of the chimeric Frsv-HNiw into the medium of Sf9 and 
High Five cells infected with the recombinant baculoviruses. 

b) Purification of the recombinant product 

5 

SF9 cells, adapted to serum free medium, were infected with the plaque purified 
recombinant baculovirus V5 or 3546. The cells were grown in suspension in 500ml 
Erlenmeyer flask in SF900II medium (Gibco BRL). The medium from virus infected 
cells were harvested two days post-infection. The soluble FRsv-HNpiw product was 

10 purified from the medium of infected cells by immunoaffinity chromatography using an 
anti-F RSV monoclonal antibody, mAbl9. The anti-F monoclonal antibody was 
coupled to Activated CH Sepharose 4B (Pharmacia) following the manufacturer 
instructions. The immunoaffinity gel was washed 3 times with 10 bed volumes of 
buffer A (20mM phosphate buffer pH 6.4, NaCl 150mM) prior to sample loading. 

15 After 16 hours at 4°C, the gel was washed with buffer A and the chimeric product was 
eluted with lOOmM phosphoric acid. Eluted protein was neutralized immediately with 
one tenth of volume of 1M phosphate buffer pH 7. 

SDS-PAGE of the immunoaffmity-purified F RS v-HN PiV 3 revealed the presence of a 
20 major protein band of about 110 kDa. This protein was visualized by Coomassie 
blue staining of the gel and reacted with the monoclonal antibody anti-F^y (mAbl9) 
or with the polyclonal serum (20RG45) on immunoblots (Fig25). 

c) production of polyclonal antibodies 

25 

In order to obtain specific antibodies, the baculovirus derived Frsv-HNp^ protein, 
purified by immunoaffinity as described above, was used to immunise four BalbC mice 
and two New Zealand white rabbits. Three sub-cutaneous injections of 
20ng/ml/dose/rabbit or 6^g/100^1/dose/mouse were done at three weeks interval. The 
30 sera were collected 3 weeks after the second and the third injection and the antibody 
response was detected using ELISA and Western blots assays. 



-37- 



WO 00/18929 



PCT/EP99/07004 



1) ELISA assays 

a) Mice response 

The antibody response was followed using a goat anti-RSV serum (20RG45, 
Fitzgerald, USA) to coat the microtiter plates and mouse anti-FHN sera as capture 
5 antibody. The antigens used were either the FaRsy-Drosophila or CHO derived, the 
pRsv-HNpivj expressed in baculovirus and the medium of CHO cells transfected by 
the pEE14 was used as a negative control. 

3 our of 4 mice sera collected after the second injection showed some but low 
10 specific response. However, the mice sera collected after the third injection showed 
a high increase in level of specific antibodies. 

b) Rabbit response 

The antibody response was followed using either one of the following ELISA. The 
15 antigens were the same as the one used to detect the mice antibody response. 

Either a goat anti-RSV serum (20RG45, Fitzgerald, USA), either a monoclonal 
antibody directed against the RSV fusion protein (mAbl9, Compton Lab, UK) or a 
monoclonal antobody directed against the PiV3 hemagglutinin-neuraminidase 
(mAb3285, Behring) were used to coat the microtiter plate and the rabbit anti-HN sera 
20 was used as a capture antibody. The first and the second test bleeds generated high 
specific antibodies. 

2) Western blot assays 

Recombinant Fa-RSV CHO-KI ou Drosophila derived, F R sv-HNpiv 3 baculovirus 
25 derived or the CHO-pEE14 spent medium culture were electrophoresed onto a 15% 
SDS-PAGE and transferred onto a nitrocellulose membrane (Amersham). The rabbit 
anti-HN sera as well as the mouse anti-HN sera detected specifically either the F 
protein or the F R sv-HNp iV 3 chimera. 
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Example 2 

i) Optimization of the codon usage of the nucleic acids sequence coding for the 
RSV fusion protein lacking the membrane anchor domain linked to the PiV3 
hemagglutin-neuraminidase lacking the signal-anchor domain, Fr^ (1-526) 
5 HN Pi v3 (70-572) for the expression in mammalian cells. 

A table showing the comparison of the codon usage found in the F^nN^ module 
with the one found in highly expressed human gene can be found in Fig.26. As 
noted, the most prevalent codons found in the FasvHNpjva module have an A or a T 
10 at their third degenerative position, whereas the human prevalent codons have a C 
or a G. For the improvement of the Fr^IIN^ protein expression, the entire coding 
sequence has been re-engineered to fit at best the human codon usage. The re- 
engineered sequence was obtained using synthetic long oligonucleotides, polymerase 
chain reaction (PCR) and conventional cloning procedures. 

15 

Re-engineering of the coding sequence of the Fr^IIN^ module 

The entire synthetic sequence was recovered by joining three PCR fragments (A, B 
and C). The general strategy to obtain each PCR fragment is schematically 
represented in Fig 27. It consists of assembling overlapping long oligonucleotides in 
20 a first round amplification. The resulting full size fragment is further amplified 
using two short primers located on each of its extremities. 

Construction of fragment A 

The first PCR fragment, corresponding to 18 bases encoding restriction sites 
followed by bases 1 to 1269 of the F^HNp^ followed by 8 bases encoding 
restriction sites, was obtained by PCR assembly of 18 overlapping oligonucleotides 
(Fig 28). This fragment has been inserted in the pCRIITOPO cloning vector 
(Invitrogen). After sequencing the fragment, it was retrieved from the pCRIITOPO 
vector by a Xbal and BsrGI digestion and inserted into the corresponding sites of 
pNIV4120. The module corresponding to F^RN^ with bases 1 to 1264 
humanized was then retrieved by an Xbal and EcoRI digestion and inserted into the 
corresponding sites of pEE14 (Fig.29) generating pEE 1 4xF RSV humHN Pi vj . 



25 



30 
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Construction of fragment B 

The second PCR fragment B corresponding to 13 bases encoding unique restriction 
sites followed by bases 1264 to 2136 of FrsyHNrvs was obtained by assembling 10 
oligonucleotides whose sequences can be found in Fig.30. This fragment has been 
5 inserted in the pCRIITOPO vector and sequenced. This fragment has been 
recovered by a BsrGI and Kpnl digestion. 

Construction of fragment C 

The third PCR fragment corresponding to bases 2023 to 3090 followed by 6 extra 
10 bases encoding an EcoRI site has been assembled starting from the 15 
oligonucleotides shown in Fig 31. This fragment has been inserted in the 
pCRIITOPO cloning vector and sequenced. This fragment has been retrieved by a 
Kpnl and EcoRI digestion (Fig 31). 

15 Construction of the entire coding sequence 

The entire FrsvHNp^ codon optimized coding sequence has been obtained by 
assembling fragment A, B, C as shown in Fig.32. pNIV4120 in which the PCR 
fragment A has replaced the original sequence (see Fig.29) was digested by BsrGI 
and EcoRI. The original sequence was eliminated and replaced by the BsrGI- Kpnl 
20 fragment B and the KpnI-EcoRl fragment C. The codon optimized module was 
retrieved from the PCRIITOPO vector by a Xbal and an EcoRI and inserted in the 
corresponding sites of the pEE14 vector. The resulting plasmid, pEERF,^ 
humHN Pi v 3 hum, encodes for the entire humanized coding sequence. The humanized 
FrsvHNp^ nucleic acids sequence is shown in Fig. 33. 

25 

Expression in CHO-KI cells 

The recombinant pEE14 Frsv humHN PiV3 (see construction of fragment A, above, or 

recombinant pEERF^ humHN P jv 3 hum see construction of the entire coding 

30 sequence, above) was transfected using the FuGene reagent (Boeringer Mannheim), 

using 5 fig DNA per 1.25 10 6 cells. The CHO-KI cells were grown in GMEM-S 

medium. The GS transfectants were selected by adding 25 /xM methionine 

sulfoximine to the culture medium two days after transfection. After ten to fourteen 
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days, resistant colonies were picked and transferred into 96 wells plates. Each 
transfoimant was then transferred into 24 wells plates and subsequently to 80 cm 2 
flasks. The GS transformants were assayed for the recombinant product when cells 
reached about 80% confluency. The procedure follows the one described in Cockett 
5 et al (Bio/Technology, 1990, 5, 662-667). Alternatively, the expression was 

evaluated three to five days after the addition of sodium butyrate (2mM) in the cell 
culture. 

To compare the expression level to that of the non humanized F^HNpiva, EUSA 
10 assays were done, using 20RG45, a goat anti-RSV serum (Fizgerald, U.S.A.) to 
coat the microtiter plates and mAbl9 anti-F RSV (G. Taylor, Inst, of Animal 
Health, Compton Lab, U.K.) as capture antibody. The expression level was 
estimated using a purified Fa-^v expressed in the Drosophila system. 

15 The level of expression of the non-humanized expressed product by 

pEEHFRsvHNpiva didn't exceed 0.03 mg/L and 0.1 mg/L when sodium butyrate 
was added to the culture medium. The level of expression of the partially 
humanized product expressed by pEEHFrsv humHNpj V3t reached 1 mg/L and up to 
3 mg/L when sodium butyrate was added in the culture medium. The humanization 

20 of the sequence coding for amino acids 1-423 of the 1029 amino acids thus 
enhanced the level of expression up to 30 fold (see Figure 34a). 

The level of expression of the entirely humanized product expressed by pEERF^ 
humHN PiV3 hum was at least of 2 mg/L and reached up to 50 mg/L when sodium 
25 butyrate was added in the culture medium. The humanization of the entire coding 
region of F RSV HNp i v 3 thus enhanced the level of expression of at least 200 to 500 
fold (see Figure 34b). 

ii) Optimization of the codon usage of the nucleic acids sequence coding for the 
30 mumps virus (MuV) fusion protein lacking the membrane anchor domain 
linked to the measles virus (MV) lacking the signal-anchor domain, F Min , (1-482) 
H Mv (59-617) for the expression in mammalian cells. 
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A table showing the comparison of the codon usage found in the FmovHmv module with 
the one found in highly expressed human gene can be found in Fig.35. As it can be 
seen, the codon usage frequencies of this chimerical gene is quite different from those 
5 prevalent in the human genome. For the improvement of the FmuvHmv protein 
expression, the entire coding sequence has been re-engineered to fit at best the human 
codon usage. The re-engineered sequence was obtained using synthetic long 
oligonucleotides, polymerase chain reaction (PCR) and conventional cloning 
procedures. 

10 

Re-engineering of the coding sequence of the F M uvHmv module 

The entire synthetic sequence was recovered by joining four PCR fragments 
(A, B, C and D). The general strategy to obtain each PCR fragment is schematically 
represented in Fig 36. It consists of assembling overlapping long oligonucleotides in a 
15 first round amplification. The resulting full size fragment is further amplified using two 
short primers located on each of its extremities. 

Construction of fragment A 

The first PCR fragment, corresponding to 13 bases specifying restriction sites and a 
20 Kozak consensus motif followed by bases 1 to 1026 of the F^v^ was obtained by 
PCR assembly of 12 overlapping oligonucleotides (Fig 37). This fragment has been 
inserted in the pCRIITOPO cloning vector (Invitrogen). After sequencing the 
fragment, it was retrieved from the pCRIITOPO vector by a Xbal and TspRI 
digestion and a 963 bp fragment was further purified, leading to fragment A. 

25 

Construction of fragment B 

The second PCR fragment B corresponding to bases 965 to 1712 of F^vHmv was 
obtained by assembling 9 oligonucleotides whose sequences can be found in Fig. 38. 
After its insertion into the pCRIITOPO vector and its sequencing, this 785 bp 
30 fragment has been recovered by a TspRI and Aval digestion. 
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Construction of fragment C 

The third PCR fragment C corresponding to bases 1712 to 2485 has been assembled 
starting from the 11 oligonucleotides shown in Fig 39. It has been inserted in the 
pCRIITOPO cloning vector and sequenced. This 774 bp fragment has been retrieved 
5 by an Aval and Apal digestion. 

Construction of fragment D 

The fourth PCR fragment D corresponding to bases 2485 to 3139 followed by 8 bp 
specifying a unique restriction site has been assembled starting from the 8 
10 oligonucleotides shown in Fig 40. This fragment has been inserted in the 
pCRIITOPO vector and sequenced. A 657 bp fragment has been recovered after an 
Apal and EcoRI digestion. 

Construction of the entire coding sequence 

15 The entire F MuV H Mv codon optimised coding sequence has been obtained by 
assembling fragment A, B, C, D and inserting the module digested by Xbal and 
EcoRI into the corresponding sites of the pEE14 vector (Fig. 41). The resulting 
plasmid, pEE 1 4F Mt) v humH Mv hum , encodes for a humanised sequence coding for aa 
1-482 of the mumps virus fusion protein followed by aa 59-617 of the measles 

20 virus. The humanised and original F^Hmv nucleic and amino acids sequences are 
shown in Fig. 42. 

iii) Purification and analysis of FHN expressed in CHO-KI 

25 a) Purification 

CHO cell line expressing secreted recombinant FHN was cultivated in cell factories in 
G-MEM medium supplemented with 2% FCS, in presence or absence of 1% Butyrate 
Na. FHN was purified by immunoaffinity chromatography by loading spent culture 
medium onto a Mabl9-sepharose column as described using the same experimental 

30 conditions. 
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When expressed in absence of Butyrate Na, purified FHN migrated on SDS-PAGE, in 
heating and reducing conditions, mainly as a band of 1 10 kDa. In contrast, FHN is 
visualized as a triplet of 1 10, 120 and 130 kDa when CHO cells are cultivated with 
butyrate. Heating has a more drastic effect than reducer on the FHN electrophoretic 
5 migration. Indeed, high molecular weight species are clearly detected in the 

preparation when electrophoresis proceeded without heating suggesting the presence 
of FHN aggregates or oligomers. These aggregates did not seem to be contaminated 
by CHO proteins. Antibodies directed to CHO proteins did not specifically recognize 
on Western blot any bands. Glycan analysis was performed using several lectins 
10 specific for different carbohydrate moieties. Surprisingly, FHN did not cany sialic 
acids or high-mannose structures but carbohydrates of galactose-acetyl-galactosamine 
type characteristic of hybrid N- and/or O-glycosylations. 

N-terminal microsequence analysis showed mainly the presence of Fl subunit in bands 
15 of 1 10-130kDa. The F2 N-terminal amino acid sequence detected in bands of lower 
and higher molecular weight indicated that some purified FHN molecules are present 
under a F0 form (non mature F). 

The presence of aggregates or oligomers in the FHN preparations was confirmed by 
20 gel filtration analysis and proteins were detected by laser-light scattering. Whatever the 
culture conditions (butyrate or not), between 50 and 65% of FHN populations 
displayed a molecular weight higher than 10 6 Da demonstrating that FHN is 
aggregated. 5 to 15% has a molecular weight ranging from 400 to 900 kDa whereas 
30 to 35% is monomeric FHN. 

25 

b) Serum immunoglobin analysis. 
Immunisation protocol 

The FrsvHNito protein was purified from the spent medium culture of the CHO-KI 
30 cells transfected by the recombinant pEE 1 4 FRsvhumHNpivshum by immunoaffinity 
chromatography as described (Purification of the recombinant product expressed in 
baculovirus recombinant infected SF9 cells). The product was injected in 7 groups of 
Balb CI mice as descibed in the following table 1 . 
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Humoral response directed against the FHN protein 

The humoral response directed against the FHN protein was determined. To this end, 
ELIS A plates were coated with immunoaflBnity purified FHN protein. 

5 

Total IgG (Fig 43) 

To detect specific anti-FHN total IgG, ELIS A plates were coated with 200ng of 
immunoaffinity purified FHN protein, plates were then saturated and dilutionsof the 
mice second bleed sera were then applied. Total IgG were detected using a biotinylated 
10 serum directed against mouse IgG. 



IgGl(Fig44) 

To detect specific anti-FHN IgGl, ELIS A plates were coated with lOOng of 
immunoaflBnity purified FHN protein, plates were then saturated and dilutionsof the 
15 mice second bleed sera were then applied. IgGl were detected using a biotinylated 
serum directed against mouse IgGl . 

IgG2a(Fig45) 

To detect specific anti-FHN IgG2a, ELIS A plates were coated with lOOng of 
20 immunoaflBnity purified FHN protein, plates were then saturated and dilutionsof the 
mice second bleed sera were then applied. IgG2a were detected using a biotinylated 
serum directed against mouse IgG2a. 

The titer of each sera was determined and a mean titer for each group was calculated 
25 and is reported in table 2. These experiments show that the FHN antigen by itself or 
formulated with adjuvant (group 1 to 3), stimulates a specific humoral response. 
Indeed, no anti-FHN antibodies are generated in the untreated mice group (group 5) or 
in the group immunised solely with the adjuvant (group 4). The group 1 (and group 4) 
adjuvant was 3D-MPL and QS21 formulated with cholesterol containing liposomes as 
30 described in WO 96/33739; the group 2 adjuvant was alum. 



The IgGl/IgG2a ratio indicates the Thl or Th2 orientation of the immune response; 
(Table2), a protective response against both the RS V or the PiV3 should tend toward 
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the Thl type, that is a low IgGl/IgG2a ratio. In this regard, the responses generated 
with the FHN formulated in the presence of the 3D-MPL + QS21 adjuvant appears to 
be the more promising one. 



5 Table 1: Experimental procedures 
Immunogenicity FHN in 
mice 



Group 


n 


Vol 
(ul) 


route 


Antigen 




Immuno- 


buffer 


preservative 










nature 


dose 
(MS) 


stimulants 






1 


12 


2x50 


IM 


FHN 


2 


3D-MPL/ 
QS21 


PBS mod 
pH7.4 


thiomersal low 
(lUg/ml) 


2 


12 


2x50 


IM 


FHN 


2 


Al(OH)3 


PBS mod 
pH7.4 


thiomersal low 
(lug/ml) 


3 


12 


2x50 


IM 


FHN 


2 


/ 


PBS mod 
pH7.4 


thiomersal low 
(lug/ml) 


4 


12 


2x50 


IM 


/ 


/ 


3D-MPL/ 
QS21 


PBS mod 
pH7.4 


thiomersal low 
(lUg/ml) 


5 


12 


/ 


/ 


untreated 


/ 


/ 


/ 


/ 


6 


12 


2x30 


INA 


RSV live 




/ 


/ 


/ 


7 


12 


2x30 


INA 


PIV-3 live 




/ 


/ 


/ 



LM=intra-muscular 
INA=intra-nasal 
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Antigen 


cc. ng/m) 


Buffer 


RSV live 


6.2 

logPFU/ml 




PIV-3 
live 


6.7 

logPFU/ml 




FHN 


120 (2.5ml) 


PBS 
pH7.3 



Time schedule: 
5 Injection 1 = Day 0 
Injection 2 = Day 28 
First Bleed = Day 28 
Second bleed = Day 42 
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Table 2: Serum antibody response against FHN. 

The total IgG, IgGl and IgG2a was determined for each mouse sera. A mean titer for 
each group was then calculated and is reported in the table. 



group 
n c 


Immunogen 


Total IgG 


IgGl 


IgG2a 


IgGl/IgG2a 


1 


FHN + 3D- 
MPL/QS21 


1182000 


109800 


305500 


0.36 


2 


FHN + Alum 


182200 


127100 


4429 


28.7 


3 


FHN 


44990 


22760 


1941 


11.73 


4 


adjuvant=from 
group 1 


49 


32 


ND 


ND 


5 


untreated 


52 


ND 


ND 


ND 


6 


Live RSV 


12840 


748 


2718 


0.27 


7 


LivePiV3 


10860 


2758 


2320 


1.19 



ND=undetermined, the titer being to low 
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Claims 

1. A process for preparing a heterochimeric protein or an immunogenic 
derivative thereof comprising an immunogenic fragment of the fusion (F) protein of 
RSV, PIV1, PIV2, PIV3, MV or MuV and an immunogenic fragment of the 

5 attachment (G, HN or H) protein of RSV, PIV1, PIV2, PIV3, MV or MuV which 
process comprises expressing recombinant DNA encoding the heterochimeric 
protein or immunogenic derivative thereof in CHO cells and recovering the protein. 

2. A process according to claim 1 wherein at least one non-preferred or less 

10 preferred codon in a natural gene or DNA encoding the said heterochimeric protein 
or immunogenic fragment thereof has been replaced by a preferred codon encoding 
the same amino acid. 

3. A heterochimeric protein or an immunogenic derivative thereof comprising an 
15 immunogenic fragment of the fusion (F) protein of RSV, PIV1, PIV2, PIV3, MV 

or MuV and an immunogenic fragment of the attachment (G, HN or H) protein of 
RSV, PIV1, PIV2, PIV3, MV or MuV, with the proviso that where one of the 
immunogenic fragments is derived from RSV F, RSV G or PIV3 F, PIV3 HN, the 
other of the immunogenic fragments is derived from MuV F, MuV HN, MV F, 
20 MV H, PIV1 F,PIV1 HN, PIV2 F or PIV2 HN. 

4. A process for preparing a heterochimeric protein or immunogenic derivative 
thereof as claimed in claim 3 which process comprises expressing recombinant 
DNA encoding the heterochimeric protein or immunogenic derivative thereof in 

25 either one of; CHO cells or insect cells and recovering the protein. 

5. A protein according to claim 3 wherein the immunogenic fragment of the F 
protein is lacking the membrane anchor domain at its C-terminal end. 

30 6. A protein according to claims 3 or 5 wherein the immunogenic fragment of the 
G, HN or H protein is lacking the signal/anchor domain at its N-terminal end. 
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7. A protein according to any one of claims 3, 5 or 6 which is linked via an amino 
acid in the C-terminal pan of the immunogenic fragment of the F protein of RSV, 
PIV1. PIV2, PIV3, MV or MuV to an amino acid in the N-terminal part of the 
immunogenic fragment of the G protein of RSV or the HN protein of PIV1, PIV2, 

5 PIV3, MuV or the H protein of MV. 

8. A protein according to any one of claims 3, 5, 6 or 7 which commences at its N- 
terminal end with a signal sequence from the F protein of RSV, PIV1, PIV2, PIV3, 
MV or MuV. 

10 

9. A protein according to any one of claims 3,5,6 or 7 which commences at its N- 
terminal end with a signal sequence from TP A. 

10. A protein according to any one of claims 3 or 5 to 8 which comprises a 
15 ubiquitin leader sequence. 

11. A protein according to any one of claims 3 or 5 to 9 which comprises a 
polyhistidine tail. 

20 12. A protein according to claim 10 or 11 which comprises a cleavage site for 
cleaving off the ubiquitin leader sequence and/or the polyhistidine tail. 

13. A heterochimeric protein according to any one of claims 3 or 5 to 1 1 which is 
selected from the group consisting of: 

25 FsVRSVxHNsa-MuV; 

Fs + a-PIV3xHNs*a"MuV; 
FsVMuV xGsaRSV; or 
FsVMuV x HNs*aPIV3, or 
an immunogenic derivative thereof. 

30 

14. A heterochimeric protein according to any one of claims 3 or 5 to 1 1 which is 
selected from the group consisting of: 
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FsVMuVxHsaMV; or 
FsVRSVxHNsaPIVl, or 
FsVRSVx HNs aPIV2, or 
an immunogenic derivative thereof. 

5 

15. A heterochimeric protein which is: 

FsV (1-526) RSV x HNsa' (70-572) PIV3, 
Fs + a" (1-492) PIV3 x Gs a' (69-298) RSV, 
FsV (1-526) RSV x HNsa" (70-572) PIV3 bis, 
10 FsV (1-526) RSV x HNsa (70-572) PIV3 ent his, or 

sTPA (1-21) UB (1-74) ent Fs a* (24-526) x HN sa'(70-572) PIV3, or 
an immunogenic derivative thereof. 

16. Recombinant DNA encoding a heterochimeric protein or an immunogenic 
15 derivative thereof according to any one of claims 3 or 5 to 15. 

17. Recombinant DNA according to claim 16 in which at least one non-preferred 
or less preferred codon in the DNA has been replaced by a preferred codon 
encoding the same amino acid. 

18. DNA which hybridises under conditions of high stringency with the DNA of 
20 claim 16 or 17. 

19. An expression vector comprising recombinant DNA according to claims 16 to 
18. 

20. A host transformed with DNA according to any one of claims 16 to 18 or with 
a vector according to claim 19. 

25 21. A host according to claim 20 which is a CHO cell. 

22. A host according to claim 21 which is an insect cell. 
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23. A vaccine composition comprising a protein according to any one of claims 3 
or 5 to 13 or an immunogenic derivative thereof in admixture with a 
pharmaceutically acceptable carrier. 

24. A vaccine composition according to claim 23 further comprising 3D 
5 Monophosphoryl lipid A and/or QS-21. 

25. A vaccine composition according to claims 23 or 24 wherein the carrier is an 
oil-in-water emulsion. 

26. A heterochimeric protein or an immunogenic derivative thereof according to 
any one of claims 3 or 5 to 15 for use in medicine. 

10 27. A process for the production of a heterochimeric protein according to any one 
of claims 3 or 5 to 15 which process comprises expressing recombinant DNA 
encoding said protein or immunogenic fragment thereof in a host cell and 
recovering the protein. 

28. A method of treating a human or animal susceptible to paramyxoviridae viral 
15 infections comprising administering an effective amount of a vaccine according to 

any one of claims 23 to 25. 

29. Use of a protein or an immunogenic derivative thereof according to any one of 
claims 3 or 5 to 15 in the manufacture of a medicament for use in the treatment of 
respiratory disorders. 
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Fig. 1 
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pNIV2820 

Hindlll BamHI 



t 

ATG 



Nsil 



BamHI EcoRI 







aa 409-526 
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STOP 



Fs + a- 



Substitution of the Nsil-EcoRI 
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generated Nsil-EcoRI DNA piece 
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A) Synthetic adaptators 

5' C ATG AAT GAT CAA GGC TTG AGC AA 3' 

TTA CTA GTT CCG AAC TCG TTA GTC [SEQ ID NO: 1] 

BspHI BbsI 



B)pNIV4102 
Hindffl BamHI 



BspHI BbsI 

1 1 



BamHI 



Frsv 1-526 



HN MuV 60-582 jL_pUC19 



ATG 



STOP 



C)pNIV4104 



BamHI 



BamHI 



1 



Frsv 1-526 



1 



HN MuV 60-582 



-pSFVl 



ATG 



STOP 
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A) Synthetic adaptators 

5' C ATG AAC AAT GAG TTT ATG GAA GTT ACA GAA AAG ATC CAA 
TTG TTA CTC AAA TAC CTT CAA TGT CTT TTC TAG GTT 

BspHI 



ATG GCA TCG GAT ATT AT 3' 
TAC CGT AGC CTA TAA TATA 

Asel 



[SEQ ID NO: 2] 



B)pNIV4109 
BamHI 



Asel BspHI 



si tispl 



BamHI 



HN PIV3 70-572 



STOP 



Frsv 1-526 i^ P UC19 



ATG 



C)pNIV4110 

BamHI 







f 


Frsv 1-526 











BamHI 



ATG 



HN m3 70-572 i_ p SFVl 



STOP 
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Fig. 6 

A) Synthetic adaptators 

5' GAT CTA GAA GAG TCA AAA GAA TGG ATA AGA AGG TCA AAT CAA 
AT CTT CTC AGT TTT CTT ACC TAT TCT TCC AGT TTA GTT 

Bglll 

AAA CTA GAT TCC ATT GGA AAT TGG CAT CAA TCT AGC ACC 3' 
TTT GAT CTA TGG TAA CCT TTA ACC GTA GTT AGT TCG TGG CAGT G 

Maelll 



[SEQIDNO:3] 



B)pNIV4103 

BamHI 



1 






r 


F PIV3 1-492 





Bgffl MaeHI 



Grsv 69-298 



Hindlll 

-pUC19 



ATG 



STOP 



C)pNTV4106 

Smal/BamHI 



HindllllSrasl 



F PIV3 1-492 



Grsv 69-298 ±_pSFVl 



1 



ATG 



STOP 
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Fig. 7 

A) Synthetic adaptators 

5'G ATC TAG AAG AGT CAA AAG AAT GGA TAA GAA GGT CAA ATC 
ATC TTC TCA GTT TTC TTA CCT ATT CTT CCA GTT TAG 

Bglll 

AAA AAC TAG ATT CCA TTG GAA ATT GGC ATC AAT CTA GCA CCA 
TTT TTG ATC TAA GGT AAC CTT TAA CCG TAG TTA GAT CGT GGT 



CAA ATG ATC AAG GCT TGA GCA A 3' 
GTT TAC TAG TTC CGA ACT CGT TAGTC 

Bbsl 



[SEQ ID NO: 4] 



B)pNIV4117 
BamHI 



BglUBbsI 





4 


r 


F m3 1-493 







HN MuV 60-582 



BamHI 



±_pUC19 



ATG 



STOP 



C)pNIV4118 
BamHI 









r 


Fp,v3 1-493 





HN MuV 60-582 



BamHI 
_ l_pSFVl 



ATG 



STOP 
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A) Synthetic adaptators 

5' GAA TGC CGT TAA ATA CAT CAA GAG AGT AAC CAT CAA 

A CGT CTT ACG GCA ATT TAT GTA GTT CTC TCA TTG GTA GTT 
PstI 

CTC CAT CGG TCT CAG TAA GTT CTA AA 3' 

GAG GTA GCC AGA GTC ATT CAA GAT TTC AGT [SEQ ID NO 

Maem 



B)pNTV4113 
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A) Synthetic adaptators 

5' GTAAGTTCTAAA 3' 

CAAGATTTTTAA 
Bsal EcoRI 



[SEQ ID NO: 6] 



B)pNIV4115 
BamHI 
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i 1 



BamHI 
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A) Synthetic adaptators 

5' C ATG AAC AAT GAG TTT ATG GAA GTT ACA GAA AAG ATC CAA 
TTG TTA CTC AAA TAC CTT CAA TGT CTT TTC TAG GTT 

BspHI 



ATG GCA TCG GAT ATT AT 3' 
TAC CGT AGC CTA TAA TAT A 

Asel 



[SEQ ID NO: 7] 
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BamHI Spel 

1 



BspHI Asel 



BamHI 



Frsv 1-526 



■ 



HN WV3 70-572bis 



-pUC19 



ATG 

C) pEE14 F s + a" RSV xHN s a PiV3bis 



STOP 



BclI/BamHl 



BamHI /Bell 



1 






r 


Frsv 1-526 





HNp, V3 70-572bis 



^_pEE14 



ATG 



STOP 
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Fig. 19 

A) Synthetic adaptators 

PstI 5'GT GAC GAT GAC GAT AAG CAT CAT CAT CAT CAT CAT TAG 
ACGTC ACA CTG . CTA CTG CTA TTC GTA GTA GTA GTA GTA GTA ATC 



GGATCCGCATG 3" 

CCTAGGC Sphl [SEQIDNO:8] 



B) pNIV3340 



Xhol PstI Sphl BamHI 

I 



HN PIV3 1-572 



t 

ATG 

C)pNIV4120 

BamHI 



Aspx4Lys Hisx6 



STOP 

Xhol BamHI 



1 



Frsv 1-526 



HN PIV3 70-572 



ATG 

C) pEE 1 4 F s + a' RSV xHN s a' PiV3 enthis 

BamHI Xhol 



STOP 



pIBI 



pUC19 



1 



i 



BamHI 



F^v l" 526 



HN P , V3 70r5^ 



Aspx4Lys isx6 ^ 



pEE14 



ATG 



STOP 
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Fig. 20 



A) pNIV3475 

EcoRI Xbal 



Saccharomyces cerevisiae Ubiquitin 



Afl^I BamHI 

▼ 

Z_pUC19 



ATG 



Gly 76 



B) Synthetic adaptators 



5'CT AGC ATG CAG ATC TTC GTC AAG ACG TTA ACC GGT AAA ACC 
Nhel G TAC GTC TAG AAG CAG TTC TGC AAT TGG CCA TTT TGG 



ATA ACC 3" Xbal 
TAT TGG ATCT 



[SEQ ID NO: 9] 



C)pNIV4122 



HindHI 



Nhe: 



Xbal 



aa 1-21 sTPA 



A-S 



Aflll BamHI 



aa 1-76 Ubiquitin 



-pBluescript 



ATG 



ATG 



I 

Gly76 
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A)pNIV4122 
Hindlll 



aa 1-21 sTPA 



Ala-Ser 



ATG 



21/73 



aa 1-76 Ubiquitin 



Aflll BamHI Spel 

pBluesciipt 



Gly76 



B) Synthetic adaptators 

5'TTA AGA CTA AGA GAC GAT GAC GAT AAG TCC AGT CAA AAC 
Aflll CT GAT TCT CTG CTA CTG CTA TTC AGG TCA GTT TTG 

ATC ACT GAA GAA TTT TAT CAA TCA ACA TGC AGT GCA GTC AGC 
TAG TGA CTT CTT AAA ATA GTT AGT TGT ACG TCA CGT CAG TCG 

AAA GGC TAT CTT AGT GCT CTA AGA ACT GGT TGG TAT A3' Spel 
TTT CCG ATA GTT TCT CGA GAT TCT TGA CCA ACC ATA TGA TC 



[SEQ ID NO: 10] 
C)pNIV4123 

Hindlll Aflll 



Spel 

i 





sTPA 


a-s 


aa 1-74 
Ubiquitin 


cnieroK 


aa 24-55 F^y 




A 







-pBluesciipt 



ATG 
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Fig. 22 

A)pNIV4123 
Hindlll 



Spel 



1 








f 


sTPA 


a i aa 1-74 
s lUbiquitin 


entero 
K 


aa 24-55 

Frsv 






\ t r 





.pBluescript 



ATG ATG ATG 



B)pNIV4120 

Xbal Spel 



EcoRI 



Frsv 1-526 



HN PIV3 70-572 



ATG 



-pUC19 



STOP 



C)pNIV4124 
Xbal 



Spel 

1 



EcoRI 



sTPA 



a 


aa 1-74 


entero 


aa 24-55 


s 


Ubiquitin 


K 













Frsv 56-526 



HN PIV3 70-572 



ATG 



1 



-pUC19 



D) pEE14 sTPA UBI EN Fs'a" RSV x HN s a" PiV3 



Xbal 



Spel 

i 



Stop 



EcoRI 



sTPA 


a 


aa 1-74 


entero 


aa 24-55 




s 


Ubiquitin 


K 


Frsv 











Frsv 56-526 



HN PIV3 70-572 



Z_pEE14 



t 



ATG 



t 

Stop 
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Fig. 23 



A)pNIV4120 
BamHI 



BamHI 







Frsv 1-526 


■ 


HN PIV3 70-572 











j)UC19 



ATG 



STOP 



B)pNIV4132 



BamHI 



Polyhedrin 
promoter 



Frsv 1-526 



BamHI 



HNp IV3 70-572 



P AcUW51 
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Fig. 24 
A)pNIV4120 
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Spel 



BamHI 



Frsv 1-526 



HN P]V3 70-572 



_pUC19 



ATG 



STOP 



B) Synthetic adaptators 

5 'GAT CAA AAC ATC ACT GAA GAA TTT TAT CAA TCA ACA TGC 
BamHI TT TTG TAG TGA CTT CTT AAA ATA GTT AGT TGT ACG 

AGT GCA GTC AGC AAA GGC TAT CTT AGT GCT CTA AGA ACT 
TCA CGT CAG TCG TTT CCG ATA GAA TCA CGA GAT TCT TGA 

GGT TGG TAT A 3 'Spel 

CCA ACC ATA TGA TC [SEQ ID NO: 11] 

C) pNIV4136 



BamHI Spel 









Polyhedrin 
promoter 




gp67 
signal 


F^ 25-526 



BamHI 



HN PIV3 70-572 



pAcUW51 
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Fig 25: SDS-PAGE (reduced conditions) of the Fj^vHNp^ protein purified by 
immunoaffinity from the spent culture medium of the recombinant baculovirus 3546. 

kDa: molecular weight marker 

A: Coomassie blue staining 

B: Western blot revealed by a goat polyclonal anti-RSV serum 20 RG45 



kDa A B 




SUBSTITUTE SHEET (RULE 26) 
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Fig . 26: Codon usage of Fr^HNp^ and highly expressed human genes (hum high exp) 
showing frequencies (xlOO) of the individual codons for each of the degenerately encoded 
amino acids, and the most prevalent codon in bold. 
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Fig. 27: Schematic diagram of the PCR synthesis of each fragment showing unique 
restriction sites along the sequence (black dots) and restriction sites (A and B) that allow 
retrieval of the full size fragment from the cloning vector. 



B 



PCR 



PCR 



ligation and insertion into the pcrllTOPO 
cloning vector. 



Sequencing 



Retrieval from the pcrllTOPO vector by restriction with 
enzyme A and B. 
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Fig . 28: Sequence of the 18 oligonucleotides from which PCR fragment A was generated. 

1) olfhuml.seq, bases 1-90 of F R svHNp iV 3/ homologous to mRNA 

5' cccTCTAGAGGATCCACCATGGAGCTGCTGATtttaAAGACCAACGCCATCACCGCCATCCTG 
GCCGCGGTGACCCTCTGCTTCGCGTCC 

2) olf hum2 . seq, bases 75-165 of FRsyHNpivs, inverse complementary to 
mARN 

5' CCTCAGCGCGCTCAGGTAGCCCTTGCTGACAGCagaGCAGGTGGACTGGTAGAACTCCTCGGTG 
ATGTTCTGGCTGGACGCGAAGCAGAGG 

3) olfhum3.seq, bases 150-240 of F RS vHN Pi v3, homologous to mRNA 

5' CCTGAGCGCGCTGAGGACGGGGTGGTACACtAGtGTGATCACCATCGAGCTGAGCAACATCAAG 
GAGAACAAGTGCAACGGCACCGACGCC 

4) olfhum4.seq / bases 225-310 of F RS vHNpiv3, inverse complementary to 
mARN 

5' GCATCAGCAGCTGCAGCTCGGTCACGGCGCTCTTGTACTTGTCCAGCTCCTGCTTGATCAGCTT 
CACCTTGGCGTCGGTGCCGTTG 

5) olfhumS.seq, bases 295-397 of F RSV HNpiv3, homologous to mRNA 
5'CTGCAGCTGCTGATGCAGAGCACCCCCGCCACCAACAACagaGCCAGGCGCGAGCTGCCCAGGT 
TCATGAACTACACCCTCAACAACACCAAGAACACCAACG 

6) olfhum6.seq, bases 378-496 of F RS vHN PiV 3, inverse complementary 
to mRNA 

GGTGCAGGACCTTGGACACCGCGATGCCGCTGGCGATGGCGGAGCCCACGCCCAGCAGGAAGCCCA 
GGAAgCGcctCTTgCgCTTCTTGCTCAGGGTCACGTTGGTGTTCTTGGTGTTG 

7) olfhum7.seq, bases 480-561 of F RS vHNp iV 3, homologous to mRNA 

5 ' GTCCAAGGTCCTGCACCTGGAGGGGGAGGTGAACAAGATCAAGAGCGCCCTGCTCTCCACCAAC 
AAGGCGGTGGTCAGCCTG 

8) olfhum8.seq, bases 543-633 of F RS vHNp iV 3f inverse complementary 
to mARN 

5'GGGGAGCAatTGCTTGTCGATGTAGTTCTTGAGGTCCAGCACCTTGCTGGTCAGCACGCTCACG 
CCGTTGGACAGGCTGACCACCGCCTTG 

9) olfhum9.seq, bases 609-676 of FrsvHN?^, homologous to mRNA 

5 ' CTACATCGACAAGCAa t TGCTCCCCATCGTGAACAAGCAGt cCTGCAGCATCTCTAACATTGAG 
ACCG 

10) olfhumlO.seq, bases 653-732 of FrsvHNp^, inverse complementary 
to mARN 

5' GCTGAACTCCCTGGTGATCTCCAGCAGCCTGTTGTTCTTCTGCTGGAACTCGATCACGGTCTCA 
ATGTTAGAGATGCTGC 
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.11) olfhumll . seq, bases 714-787 of FRsvHN P iv3, homologous to mRNA 
5'GATCACCAGGGAGTTCAGCGTGAACGCgGGcGTcACCACCCCGGTGAGCACCTACATGCTGACC 
AACAGCGAGC 

12) olf huml2 . seq, bases 768-84 6 of FRsvHNpi V 3, inverse complementary 
to mARN 

5' GTTGGACATaAGCTTCTTCTGGTCGTTGGTGATGGGCATGTCGTTGATCAGGGACAGCAGCTCG 
CTGTTGGTCAGCATG 

13) olfhuml3 . seq, bases 825-916 of FRsvHNpi V 3/ homologous to mRNA 
5'CCAGAAGAAGCTtATGTCCAACAACGTGCAGATCGTGCGCCAGCAGAGCTACagCATCATGagC 
ATCATCAAGGAGGAGGTGCTGGCCTACG 

14) olfhuml4 . seq, bases 900-990 of FRsvHNpiva, inverse complementary 
to mARN 

5' GGTGGTGCACAGGGGGGAGGTGTGCAGCTTCCAGCAGGGGGTGTCGATCACGCCGTACAGGGGC 
AGCTGCACCACGTAGGCCAGCACCTCC 

15) olfhumlS.seq, bases 975-1065 of FrsvHNpivb, homologous to mRNA 

5' CCCCCTGTGCACCACCAACACCAAGGAGGGCTCCAACATCTGCCTGACCCGCACCGACCGGGGC 
TGGTACTGCGACAACGCCGGCTCCGTG 

16) olfhuml6.seq, bases 1048-1133 of FrsvHN?^/ inverse 
complementary to mARN 

5'CTGTTCATGGTGTCGCAGAACACGCGGTTGGACTGCACCTTGCAGGTCTCCGCCAGGGGGAAGA 
AGGACACGGAGCCGGCGTTGTC 

17) olfhuml7 . seq, bases 1116-1210 of FasvHNpiva, homologous to mRNA 
5'CTGCGACACCATGAACAGCCTGACCCTGCCCAGCGAGGTGAACCTCTGCAACATCGACATCTTC 
AACCCCAAGTACGACTGCAAGATtATGacctcc 

18) olfhuml8 . seq, bases 1195-1295 of F RS vHNpi V 3/ inverse 
complementary to mARN 

gggaattctgtacacttggtcttgccgtagcaggacacgatggcgcccagggaggtgatcacggag 
ctgctcacgtcggtcttggaggtCATaATCTTGCAG 



[SEQUENCES ABOVE ARE SEQ ID NOs: 12 to 29, respectively] 
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Fig. 29: Construction of pEE14 FRsvhumHNpjyj 



a) PCR fragment A 



Xbal 



BsrGI 



Folium 1-1264 



ATG 



b) pNIV4120 +PCR fragment A 

BsrGI 

Xbal 



F RSV hum 1-1264 



EcoRI 



FRsvHeS-lSTgXHNpj^ pUC19 



c) pEE14 F RSV hum HN P 



BsrGI 



Xbal 



F P c V hum 1-1264 



EcoRI 



F R c V 1265-1578XHN 



PiV3 



-pEE14 
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Fig. 30: Sequence of the 10 oligonucleotides from which PCR fragment B was generated. 

1) olfhnhuml9. seq, bases 1269-1353 of FRsvHNp iV 3, homologous to mRNA 
5 ' cggcaagaccaagtgtacagcctccaacaagaaccgcggcatcatcaagaccttctccaacgg 
ctgcgactacgtgtccaacaag 3' 

2) olfhnhum20. seq, bases 1336-1428 of FRsvHN P iv3, inverse 
complementary of mARN 

5' cttcacgtacaggctcttgccctcctgcttgttcacgtagtacagggtgttgcccacggacac 
ggtgtccacgcccttgttggacacgtagtc 3 1 

3) olfhnhum21. seq, bases 1413-1497 of FRsvHN P iv3, homologous, to mARN 
5 1 gagcctgtacgtgaagggcgagcccatcatcaacttctacgacccgctggtgttcccctccga 
cgagttcgacgcctccatctccc 3 1 

4) olfhnhum22. seq, basesl483-1599 of FrsvHNp^, inverse 
complementary of mARN 

5 1 gttcatgatgttggtggtggacttgccggcgttcacgttgtgcagcagctcgtcggacttgcg 
gatgaaggccaggctctggttgatcttctcgttcacctgggagatggaggcgtc 3 1 

5) olf hnhum23. seq, bases 1581-1691 of FrsvHNpivs, homologous to mARN 
5 1 caccaccaacatcatgaacaacgagttcatggaggtgaccgagaagatccagatggcctccga 
caacatcaacgacctgatccagtccggcgtgaacacccggctgctgac 3 1 

6) olf hnhum24 , seq, bases 1677-1779 of F R svHN Pi v3, inverse 
complementary of mARN 

5 ! gatggtgatctcgctgatgaacttccgcaggtcggacatctgctgggtcagggagatggggat 
gtagttctgcacgtggctctggatggtcagcagccgggtg 3 ' 

7) olfhnhum25. seq, bases 1761-1865 of F RS vHNpiv3, homologous to mRNA 
5 1 catcagcgagat caeca tccggaacgacaaccaggaggtgcccccccagaggatcacccacga 
cgtgggcataaagcccctgaaccccgacgacttctggcgctg 3 f 

8) olf hnhum26. seq, bases 1849-1967 of F RS vHNpiv3, inverse 
complementary of mARN 

5 ' gtgcgcacgcagccgtccacggtggtgggcatggccagcaggccgggcccgggcatcagcctt 
atcttgggggtcttcatcagggaggggaggccggaggtgcagcgccagaagtcgtc 3 ' 

9) olfhnhum27.seq, bases 1953-2059 of FRsvHNpi V 3/ homologous to mRNA 
5 1 cggctgcgtgcgcaccccctccctggtgatcaacgacctgatctacgcctacacctccaacct 
gatcacccgcggctgccaggacatcggcaagtcctaccaggtgc 3 ' 

10) olfhnhum28.seq, bases 2043-2154 of FrsvHNpivs, , inverse 
complementary of mARN 

5 • ggacttcctgttgtcgttgatgttgaaggtgtgggagatccgggggttcaggtcgggcaccag 
gtcggagttcacggtgatgatgccgatctgcagcacctggtaggacttg 



[SEQUENCES ABOVE ARE SEQ ID NOs : 30-39, respectively] 
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Fig. 31: Sequence of the 16 oligonucleotides from which PCR fragment C was generated. 

1) olfhnum29.seq, bases 2139-2229 of FRsvHN P iv3, homologous to mRNA 

5 1 cgacaacaggaagtcctgctccctggccctcctgaacaccgacgtgtaccagctgtgctccac 
gcccaaggtggacgagcgctccgactac 3 ' 

2) olf hnhum30 . seq, bases 2214-2307 of FRsvHNpiva/ inverse 
complementary to mRNA 

5' gttcttgaagcgggtggtggagatggagccgtcgtggttgacgatgtccagcacgatgtcctc 
gatgccggagctggcgtagtcggagcgctcg 3 1 

3) olfhnhum31. seq, bases 2292-2398 of FRsvHN P iv3/ homologous to mRNA 
5' cacccgcttcaagaacaacaacatcagcttcgaccagccctacgccgccctgtacccctccgt 
g'ggccccggcatctactacaagggcaagatcatcttcctgggc 3 ' 

4) olfhnhum32.seq / bases 2382-2472 of F RS vHN P iv3, inverse 
complementary to mRNA 

5' ccgctgggtcttgccggggcacccggtggtgttgcagatggcgttctcgttgatggggtgctc 
caggccgccgtagcccaggaagatgatc 3' 

5) olfhnhum33 . seq, bases 2457-2549 of FRs V HNpi V 3, homologous to mRNA 
5 1 cggcaagacccagcgggactgcaaccaggcctcccacagcccctggttctccgaccgccgcat 
ggtgaactccatcatcgtggtggacaaggg 3 1 

6) olf hnhum34 . seq, bases 2532-2 64 3 of F RSV HNpiv3/ inverse 
complementary to mRNA 

5' cttgttgcccagcagcagcaggcggccctcggagccccagtagttctgccgcatggagatggt 
ccacaccttcagcttggggatggagttcaggcccttgtccaccacgatg 3 1 

7) olfhnhum35. seq, bases 2628-2726 of FRsvHN Pi v3, homologous to mRNA 
5 ' gctgctgggcaacaagatctacatctacacccgctccaccagctggcacagcaagctgcagct 
gggcatcatcgacatcaccgactacagcgacatccg 3 ' 

8 ) olf hnhum36. seq, bases 2710-2781 of F RSV HN P iv3, inverse 
complementary to mRNA 

5' ggggcactcgttgttgccgggccggctcagcacgttgtgccaggtccacttgatgcggatgtc 
gctgtagtc 3 ' 

9) olf hnhum37 . seq, bases 2765-2836 of F RS vHNpi V 3, homologous to mRNA 
5' gcaacaacgagtgcccctggggccactcctgccccgacggctgcatcaccggcgtgtacaccg 
acgcctacc 3 1 

10) olfhnhum38.seq, bases 2820-2889 of F RS vHN Piv3 , inverse 
complementary to mRNA 

5 1 cttctgggagtccaggatcacggagctcacgatgctgccggtggggttcagggggtaggcgtc 
ggtgtac 
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11 ) olf hnhum39 . seq, bases 2874-2943 of FRsvHNpi V 3, homologous to mRNA 
5 ' cctggactcccagaagtcccgggtgaaccccgtgatcacctacagcacctccaccgagcgcgt 
gaacgag 

12) olfhnhum40.seq, bases 2927-2994 of FrsvHN pi V3 f r om : 1 to: 68, 
inverse complementary to mRNA 

5' gcagctggtggtggtgtagccggcgctcagggtcttgttgcggatggccagctcgttcacgcg 
ctcgg 

13) olfhnhum41 .seq, bases 2979-3043 of FRsvHNpi V 3/ homologous to mRNA 
5' caccaccaccagctgcatcacccactacaacaagggctactgcttccacatcgtggagatcaa 
cc 

14) olf hnhum42 . seq, bases 3027-3085 of FRsvHNpiva, inverse 
complementary to mRNA 

5 ' cggtcttgaacagcatgggctggaaggtgtccaggctcttgtggttgatctccacgatg 3 1 

15) olfhnhum43 . seq, bases 3069-3114 of FrsvHNp^, homologous to mRNA 
5' catgctgttcaagaccgagatccccaagagctgcagctaaGAATTC 3 1 



[SEQUENCES ABOVE ARE SEQ ID NOs : 4 0-54, respectively] 
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Fig. 32 : Construction of pEEHF^hum HN p^hum 

a) pNIV4120 +PCR fragment A 

BsrGI 

Xbal 



EcoRI 



FRgyhum 1-1264 



1 RSV 

1265-1578XHN ttV J_p UC 19 



b) PCR fragment B 



BsrGI 



Kpnl 



F RSV HNp iV il264-2136hum 



c) PCR fragment C 



Kpnl 



EcoRI 



Frsv HNpiV32136- 
3O90hum 



d) pEE14 F RSV hum HNp^hum 



Xbal 



EcoRI 



FasvhumHNpjvjhum 



pEE14 
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Fig . 33A : Humanized nucleic acids sequence of F RSV HN nV 3 (upper sequence) compared to 
the original sequence found in the pNTV4120. 

7 AGAGGATCC ACCATGGAGCTGCTGATtttaAAGACCAACGCCA 4 9 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 

22 62 AGAGGATCCCCCGGGTAccatggagttgctaatcctcaaaacaaatgcaa 2311 

50 TCACCGCCATCCTGGCCGCGGTGACCCTCTGCTTCGCGTCCAGCCAGAAC 99 

' i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii n 1 1 1 1 1 1 1 1 

2312 ttaccgcaatccttgctgcagtcacactctgttttgcttccagtcaaaac 2361 

100 ATCACCGAGGAGTTCTACCAGTCCACCTGCtctGCTGTCAGCAAGGGCTA 149 
Mill II II II II II II II III i I I I I ! I I I I I I I I I I 
2362 atcactgaagaattttatcaatcaacatgcagtgcagtcagcaaaggcta 2411 

150 CCTGAGCGCGCTGAGGACGGGGTGGTACACtAGtGTGATCACCATCGAGC 199 
II II II II II II II I I I I I I I I I I I I I II II II II 
2412 tcttagtgctctaagaactggttggtatactagtgttataactatagaat 24 61 

200 TGAGCAACATCAAGGAGAACAAGTGCAACGGCACCGACGCCAAGGTGAAG 24 9 
I II II I I I I I I II II I I II I II II II I I I I I I I II I II 
24 62 taagtaatatcaaggaaaataagtgtaatggaacagacgctaaggtaaaa 2511 

250 CTGATCAAGCAGGAGCTGGACAAGTACAAGAGCGCCGTGACCGAGCTGCA 2 99 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
2512 ttgataaaacaagaattagataaatataaaagtgctgtaacagaattgca 2561 

300 GCTGCTGATGCAGAGCACCCCCGCCACCAACAACagaGCCAGGCGCGAGC 34 9 
I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2562 gttgctcatgcaaagcacaccggcaaccaacaatcgagccagaagagaac 2611 

350 TGCCCAGGTTCATGAACTACACCCTCAACAACACCAAGAACACCAACGTG 399 

I i I Mill Mill II II I II II II I I I II I II I I I I I II 

2612 taccaaggtttatgaattatacactcaacaataccaaaaataccaatgta 2661 

4 00 ACCCTGAGCAAGAAGcGcAAGaggCGcTTCCTGGGCTTCCTGCTGGGCGT 44 9 

II I II I M I II I I I I I I II I I I I I I I I I I I I I I 
2662 acattaagcaagaaaaggaaaagaagatttcttggctttttgttaggtgt 2711 

• • ■ * • 

4 50 GGGCTCCGCCATCGCCAGCGGCATCGCGGTGTCCAAGGTCCTGCACCTGG 4 99 

II II M I I I I I I M I I I I I II II II I I M I II I I I II I I I 
2712 tggatctgcaatcgccagtggcattgctgtatctaaggtcctgcacctag 2761 

500 AGGGGGAGGTGAACAAGATCAAGAGCGCCCTGCTCTCCACCAACAAGGCG 549 
I II II I I M I II I I Mill II II II II I I I I I II I I I I II 
27 62 aaggggaagtgaacaaaatcaaaagtgctctactatccacaaacaaggct 2811 
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550 GTGGTCAGCCTGTCCAACGGCGTGAGCGTGCTGACCAGCAAGGTGCTGGA 599 
II I I I I I I I II II II II I I II I I I I I I I I I III I II 
2812 gtagtcagcttatcaaatggagttagtgtcttaaccagcaaagtgttaga 2861 
. • • • • 

600 CCTCAAGAACTACATCGACAAGCAatTGCTCCCCATCGTGAACAAGCAGt 64 9 

I II I I I I I I I I II II II II Ml I II II I I I I I I I I I I I 
2862 cctcaaaaactatatagataaacagttgttacctattgtgaacaagcaaa 2911 

650 cCTGCAGCATCTCTAACATTGAGACCGTGATCGAGTTCCAGCAGAAGAAC 699 
III I I I II II I I I I I I I I II I I I I I I I I I I I I I II I I I I I I 
2912 gctgtagcatatcaaacattgaaactgtgatagagttccaacaaaagaac 2961 
• • • ■ • 

700 AACAGGCTGCTGGAGATCACCAGGGAGTTCAGCGTGAACGCgGGcGTcAC 749 
I I I I I II II I II I I I I II I II I II II II II II II II II 
2962 aacagactactagagattaccagggaatttagtgttaatgcaggtgtaac 3011 

7 50 CACCCCGGTGAGCACCTACATGCTGACCAACAGCGAGCTGCTGTCCCTGA 7 99 
II II I I I I I I I I I I I I I I I I I I I I I I III II 
3012 tacacctgtaagcacttatatgttaacaaatagtgaattattatcattaa 3061 

800 TCAACGACATGCCCATCACCAACGACCAGAAGAAGCTtATGTCCAACAAC 84 9 
I I I I II I I I I I II II II II I I I I I III I I I I I I I I I I I I 
3062 tcaatgatatgcctataacaaatgatcagaaaaagttaatgtccaacaat 3111 

850 GTGCAGATCGTGCGCCAGCAGAGCTACagCATCATGagCATCATCAAGGA 8 99 
I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
3112 gttcaaatagttagacagcaaagttactctatcatgtccataataaagga 3161 



900 GGAGGTGCTGGCCTACGTGGTGCAGCTGCCCCTGTACGGCGTGATCGACA 94 9 

I I I I I Mill II II II I I I I I I I I I I I II I I I 
3162 ggaagtcttagcatatgtagtacaattaccactatatggtgtaatagata 3211 

950 CCCCCTGCTGGAAGCTGCACACCTCCCCCCTGTGCACCACCAACACCAAG 999 

I II II I I I I I I I I I I I I I I I I I I II II II I I I I I I I I III 
3212 caccttgttggaaactgcacacatcccctctatgtacaaccaacacaaag 3261 

1000 GAGGGCTCCAACATCTGCCTGACCCGCACCGACCGGGGCTGGTACTGCGA 104 9 

II II I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I II 
3262 gaagggtccaacatctgtttaacaagaaccgacagaggatggtactgtga 3311 

1050 CAACGCCGGCTCCGTGTCCTTCTTCCCCCTGGCGGAGACCTGCAAGGTGC 1099 

III II II II II II I I I I I I I I II II II II II II II I 
3312 caatgcaggatcagtatctttcttcccactagctgaaacatgtaaagttc 3361 

• • • • • 

1100 AGTCCAACCGCGTGTTCTGCGACACCATGAACAGCCTGACCCTGCCCAGC 1149 
I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I 
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3362 aatcgaatcgagtattttgtgacacaatgaacagtttaacattaccaagt 3411 

1150 GAGGTGAACCTCTGCAACATCGACATCTTCAACCCCAAGTACGACTGCAA 1199 

II II II I II I I I I I I I I II I I I I I I I I I I I I I I II II I I I I I 
3412 gaagtaaatctctgcaacattgacatattcaaccccaaatatgattgcaa 34 61 

• > ■ a • 

1200 GATtATGacctccaagaccgacgtgagcagctccgtgatcacctccctgg 124 9 

I I I I I I I I II II II II II I I I I I I I II I I I II I I II II I 
34 62 aattatgacttcaaaaacagatgtaagcagctccgttatcacatctctag 3511 

1250 gcgccatcgtgtcctgctacggcaagaccaagtgtacagcctccaacaag 1299 

I I I I I I I I I I I I I I I I I I I I I II II I II I I I I I I I I I I II 

3512 gagccattgtgtcatgctatggcaaaactaaatgtacagcatccaataaa 3561 

1300 aaccgcggcatcatcaagaccttctccaacggctgcgactacgtgtccaa 134 9 

II II I I I I I I I I I I I I II II I I I I I II II II II II II 
3562 aatcgtggaatcataaagacattttctaacgggtgtgattatgtatcaaa 3611 

1350 caagggcgtggacaccgtgtccgtgggcaacaccctgtactacgtgaaca 1399 

I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
3612 taagggggtggacactgtgtctgtaggtaatacattatattatgtaaata 3661 

1400 agcaggagggcaagagcctgtacgtgaagggcgagcccatcatcaacttc 144 9 

I I I I II I I I II II II II II II II II II II II II III 
3662 agcaagaaggcaaaagtctctatgtaaaaggtgaaccaataataaatttc 3711 

14 50 tacgacccgctggtgttcccctccgacgagttcgacgcctccatctccca 14 99 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
3712 tatgacccattagtgttcccctctgatgaatttgatgcatcaatatctca 3761 

1500 ggtgaacgagaagatcaaccagagcctggccttcatccgcaagtccgacg 154 9 

II II I I I I II I I I I I I I I I I I I I II II II II II I I I I I I 

37 62 agtcaatgagaagattaaccagagcctagcatttattcgtaaatccgatg 3811 

1550 agctgctgcacaacgtgaacgccggcaagtccaccaccaacatcatgaac 1599 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
3812 aattattacataatgtaaatgctggtaaatccaccacaaatatcatgAAC 3861 

1600 aacgagttcatggaggtgaccgagaagatccagatggcctccgacaacat 164 9 

II I I I I I I I I I I II II II I I I I I I I I I I I I I II II II II 

38 62 AATGAGTTTATGGAAGTTACAGAAAAGATCCAAATGGCATCGGATAATAT 3911 

1650 caacgacctgatccagtccggcgtgaacacccggctgctgaccatccaga 1699 

I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I II I 
3912 TAATGATCTAATACAGTCAGGAGTGAATACAAGGCTTCTTACAATTCAGA 3961 
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1700 gccacgtgcagaactacatccccatctccctgacccagcagatgtccgac 1749 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

3962 GTCATGTCCAGAATTATATACCaATATCATTGACACAACAAATGTCGGAT 4011 

> • • • • 

1750 ctgcggaagttcatcagcgagatcaccatccggaacgacaaccaggaggt 17 99 

II II I I I I.I I I I I i I I I I I I I MM II II II I! II 

4 012 CTTAGGAAATTCATTAGTGAAATTACAATTAGGAATGATAATCAAGAAGT 4061 

1800 gcccccccagaggatcacccacgacgtgggcataaagcccctgaaccccg 1849 

III II II II II II II II I I I I I I I I I I I II I II II I 

4 0 62 GCCTCCACAAAGAATAACACATGATGTGGGCATAAAACCTTTAAATCCAG 4111 

1850 acgacttctggcgctgcacctccggcctcccctccctgatgaagaccccc 1899 

I II II III I I I I I I II II II II I I I II I I I II II 
4112 ATGATTTTTGGAGATGCACGTCTGGTCTTCCATCTTTAATGAAAACTCCA 4161 

1900 aagataaggctgatgcccgggcccggcctgctggccatgcccaccaccgt 1949 

I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
4162 AAAATAAGGTTAATGCCGGGGCCGGGATTATTAGCTATGCCAACGACTGT 4211 



1950 ggacggctgcgtgcgcaccccctccctggtgatcaacgacctgatctacg 1999 

II I I I I I II I II II III I II II II I I I I II I II I 
4212 TGATGGCTGTGTTAGAACTCCGTCCTTAGTTATAAATGATCTGATTTATG 4261 

2000 cctacacctccaacctgatcacccgcggctgccaggacatcggcaagtcc 2049 

I II I I I I I II II II II II II I I I I I I I I I I I I II II 
4 2 62 CTTATACCTCGAATCTAATTACTCGAGGTTGCCAGGATATAGGAAAATCA 4 311 

• * • • • 
2050 taccaggtgctgcagatcggcatcatcaccgtgaactccgacctggtacc 20 99 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 

4 312 TATCAAGTATTACAGATAGGGATAATAACTGTAAACTCAGACTTGGTACC 4 361 

2100 cgacctgaacccccggatctcccacaccttcaacatcaacgacaacagga 214 9 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

4 362 TGACTTAAATCCTAGGATCTCTCATACTTTCAACATAAATGACAATAG7VA 4 411 

• • * • • 
2150 agtcctgctccctggccctcctgaacaccgacgtgtaccagctgtgctcc 2199 

MM II II II II I I I I I I I II I II II II II I I II I II 
4 412 AGTCATGTTCTCTAGCACTCCTAAACACAGATGTATATCAACTGTGTTCG 44 61 

• • * • . 
2200 acgcccaaggtggacgagcgctccgactacgccagctccggcatcgagga 2249 

II I M II II I I I I I I I I I I I I I I I I I I I I I I I I 

4 4 62 ACTCCCAAAGTTGATGAAAGATCAGATTATGCATCATCAGGCATAGAAGA 4 511 

2250 catcgtgctggacatcgtcaaccacgacggctccatctccaccacccgct 2299 

II II II II II I I I I I II II II II I I I I I II II I I 

4 512 TATTGTACTTGATATtGTCAATCATGATGGTTCAATCTCAACAACAAGAT 4 561 
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2300 tcaagaacaacaacatcagcttcgaccagccctacgccgccctgtacccc 234 9 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

4 562 TTAAGAACAATAATATAAGTTTTGATCAACCATATGCGGCATTATACCCA 4 611 

2350 tccgtgggccccggcatctactacaagggcaagatcatcttcctgggcta 2399 

II II II II II II I I I I I I I I I I I I I II II II II II II 

4 612 TCTGTTGGACCAGGGATATACTACAAAGGCAAAATAATATTTCTCGGGTA 4 661 

2400 cggcggcctggagcaccccatcaacgagaacgccatctgcaacaccaccg 2449 

II II II II II II II II I I I I I II I I I I I I I I I I I II I 
4 662 TGGAGGTCTTGAACATCCAATAAATGAGAATGCAATCTGCAACACAACTG 4 711 

2450 ggtgccccggcaagacccagcgggactgcaaccaggcctcccacagcccc 2499 

I I I I I I I I I II II III I I I I I I I I I I I I I I II II II II 
4712 GGTGTCCCGGGAAAACGCAGAGAGACTGCAATCAGGCATCTCATAGTCCT 4 761 

• • • > • 
2500 tggttctccgaccgccgcatggtgaactccatcatcgtggtggacaaggg 254 9 

I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II 
4 7 62 TGGTTTTCAGACAGAAGGATGGTCAACTCCATTATTGTTGTTGACAAGGG 4 811 

2550 cctgaactccatccccaagctgaaggtgtggaccatctccatgcggcaga 2599 

I I I I I I I II II II I I I I I I I I I I I I I II I I I I I I I II I 
4 812 CTTAAACTCAATTCCAAAACTGAAGGTATGGACGATATCCATGAGACAAA 4 861 

2600 actactggggctccgagggccgcctgctgctgctgggcaacaagatctac 264 9 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 

4 8 62 ATTACTGGGGGTCAGAAGGAAGGCTACTTCTACTAGGTAACAAGATCTAT 4 911 

2 650 atctacacccgctccaccagctggcacagcaagctgcaactgggcatcat 2 699 

II II II I II II II I I I I I I I I I I I I if I II II II 

4 912 ATATATACAAGATCTACAAGTTGGCATAGCAAGTTACAATTAGGAATAAT 4 961 

2700 cgacatcaccgactacagcgacatccgcatcaagtggacctggcacaacg 274 9 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
4 962 TGATATTACTGATTACAGTGATATAAGAATAAAATGGACATGGCATAATG 5011 

• • • > • 
2750 tgctgagccggcccggcaacaacgagtgcccctggggccactcctgcccc 2799 

Mil I II II I II I I II II II II I I I II II II I I I 

5012 TGCTATCAAGACCAGGAAACAATGAATGTCCATGGGGACATTCATGCCCA 5061 

2800 gacggctgcatcaccggcgtgtacaccgacgcctaccccctgaaccccac 284 9 

II II II II II II II II II II II II II II II I I I I I 
5062 GATGGATGTATAACAGGAGTATATACTGATGCATATCCACTCAATCCCAC 5111 

2850 cggcagcatcgtgagctccgtgatcctggactcccagaagtcccgggtga 2899 

I I II I II II I I I I I I I I I I I I I I I I I I I I I I I 
5112 AGGGAGCATTGTGTCATCTGTCATATTAGACTCGCAAAAATCGAGAGTAA 5161 
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2900 accccgtgatcacctacagcacctccaccgagcgcgtgaacgagctggcc 2949 

I I I I I I I I t I I I I II I I I I I I I I I II I I I I I I I I I I 
5162 ACCCAGTCATAACTTACTCAACATCAACTGAAAGGGTAAACGAGCTGGCC 5211 

2950 atccgcaacaagaccctgagcgccggctacaccaccaccagctgcatcac 2999 

I I I I I I I I I I II II II II II II II II I I I I I I I I II 
5212 ATCCGAAACAAAACACTCTCAGCTGGATATACAACAACGAGCTGCATTAC 5261 

• • • * • 

3000 ccactacaacaagggctactgcttccacatcgtggagatcaaccacaaga 304 9 

I I I I I I I I I I II II II II II II II II M II II II I 
5262 ACACTATAACAAAGGATATTGTTTTCATATAGTAGAAATAAATCATAAAA 5311 

3050 gcctggacaccttccagcccatgctgttcaagaccgagatccccaagagc 3099 

II I I II I I I I I I I II II I I I I I I I I II I I I I I II II III 
5312 GCTTAGACACATTCCAACCTATGTTGTTCAAAACAGAGATTCCAAAAAGC 5361 

3100 tgcagctaaGAAT 3112 

I I I I I I I I I I 
5362 TGCAGTTAATCAT 5374 
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Fig. 33B 

(Linear) MAP of: FrsvHNpiv3 . seq check: 7448 from: 1 to: 
3090 nucleic acids sequence of FrsvHNpiv3 (non humanised) 

atggagttgctaatcctcaaaacaaatgcaattaccgcaatccttgctgc 
agtcacactctgttttgcttccagtcaaaacatcactgaagaattttatc 
aatcaacatgcagtgcagtcagcaaaggctatcttagtgctctaagaact 
ggttggtatactagtgttataactatagaattaagtaatatcaaggaaaa 
taagtgtaatggaacagacgctaaggtaaaattgataaaacaagaattag 
ataaatataaaagtgctgtaacagaattgcagttgctcatgcaaagcaca 
ccggcaaccaacaatcgagccagaagagaactaccaaggtttatgaatta 
tacactcaacaataccaaaaataccaatgtaacattaagcaagaaaagga 
aaagaagatttcttggctttttgttaggtgttggatctgcaatcgccagt 
ggcattgctgtatctaaggtcctgcacctagaaggggaagtgaacaaaat 
caaaagtgctctactatccacaaacaaggctgtagtcagcttatcaaatg 
gagttagtgtcttaaccagcaaagtgttagacctcaaaaactatatagat 
aaacagttgttacctattgtgaacaagcaaagctgtagcatatcaaacat 
tgaaactgtgatagagttccaacaaaagaacaacagactactagagatta 
ccagggaatttagtgttaatgcaggtgtaactacacctgtaagcacttat 
atgttaacaaatagtgaattattatcattaatcaatgatatgcctataac 
aaatgatcagaaaaagttaatgtccaacaatgttcaaatagttagacagc 
aaagttactctatcatgtccataataaaggaggaagtcttagcatatgta 
gtacaattaccactatatggtgtaatagatacaccttgttggaaactgca 
cacatcccctctatgtacaaccaacacaaaggaagggtccaacatctgtt 
taacaagaaccgacagaggatggtactgtgacaatgcaggatcagtatct 
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ttcttcccactagctgaaacatgtaaagttcaatcgaatcgagtattttg 
tgacacaatgaacagtttaacattaccaagtgaagtaaatctctgcaaca 
ttgacatattcaaccccaaatatgattgcaaaattatgacttcaaaaaca 
gatgtaagcagctccgttatcacatctctaggagccattgtgtcatgcta 
tggcaaaactaaatgtacagcatccaataaaaatcgtggaatcataaaga 
cattttctaacgggtgtgattatgtatcaaataagggggtggacactgtg 
tctgtaggtaatacattatattatgtaaataagcaagaaggcaaaagtct 
ctatgtaaaaggtgaaccaataataaatttctatgacccattagtgttcc 
cctctgatgaatttgatgcatcaatatctcaagtcaatgagaagattaac 
cagagcctagcatttattcgtaaatccgatgaattattacataatgtaaa 
tgctggtaaatccaccacaaatatcatgAACAATGAGTTTATGGAAGTTA 
CAGAAAAGATCCAAATGGCATCGGATAATATTAATGATCTAATACAGTCA 
GGAGTGAATACAAGGCTTCTTACAATTCAGAGTCATGTCCAGAATTATAT 
ACCaATATCATTGACACAACAAATGTCGGATCTTAGGAAATTCATTAGTG 
AAATTACAATTAGGAATGATAATCAAGAAGTGCCTCCACAAAGAATAACA 
CATGATGTGGGCATAAAACCTTTAAATCCAGATGATTTTTGGAGATGCAC 
GTCTGGTCTTCCATCTTTAATGAAAACTCCAAAAATAAGGTTAATGCCGG 
GGCCGGGATTATTAGCTATGCCAACGACTGTTGATGGCTGTGTTAGAACT 
CCGTCCTTAGTTATAAATGATCTGATTTATGCTTATACCTCaAATCTAAT 
TACTCGAGGTTGCCAGGATATAGGAAAATCATATCAAGTATTACAGATAG 
G G AT AAT AAC T G T AAAC T C AG AC T T G G T ACC TG AC T T AAAT C C TAG GAT C 
TCTCATACTTTCAACATAAATGACAATAGAAAGTCATGTTCTCTAGCACT 
CCTAAAtACAGATGTATATCAACTGTGTTCGACTCCCAAAGTTGATGAAA 
GATCAGATTATGCATCATCAGGCATAGAAGATATTGTACTTGATATtGTC 
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AATCATGATGGTTCAATCTCAACAACAAGATTTAAGAACAATAATATAAG 
TTTTGATCAACCATATGCGGCATTATACCCATCTGTTGGACCAGGGATAT 
ACTACAAAGGCAAAATAATATTTCTCGGGTATGGAGGTCTTGAACATCCA 
ATAAATGAGAATGCAATCTGCAACACAACTGGGTGTCCCGGGAAAACGCA 
GAGAGACTGCAATCAGGCATCTCATAGTCCcTGGTTTTCAGACAGAAGGA 
TGGTCAACTCCATTATTGTTGTTGACAAGGGCTTAAACTCAATTCCAAAA 
CTGAAGGTATGGACGATATCCATGAGACAAAATTACTGGGGGTCAGAAGG 
7VAGGCTACTTCTACTAGGTAACAAGATCTATATATATACAAGATCTACAA 
GTTGGCATAGC/^AGTTACAATTAGGAATAATTGATATTACTGATTACAGT 
GATATAAGAATAAAATGGACATGGCATAATGTGtTATCAAGACCAGGAAA 
CAATGAATGTCCATGGGGACATTCATGtCCAGATGGATGTATTVACAGGAG 
TATATACTGATGCATATCCgCTCAATCCCACAGGGAGCATTGTGTCATCT 
GTCATATTAGACTCGC7VAAAATCGAGAGTAAACCCAGTCATAACTTACTC 
AACAtCAACTGAAAGGGTAAACGAGCTGGCCATCCGAAACAAAACACTCT 
CAGCTGGATATACAACAACGAGCTGCATTACACACTATAACAAAGGATAT 
TGTTTTCATATAGTAGAAATAAATCATAAAAGCTTAGACACATTCCAACC 
TATGTTGTTCAAAACAGAGATTCCAAAAAGCTGCAGTTAA 
[SEQ ID NO: 55] 
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Fig. 33C 

(Linear) MAP of: FrhumHNphum. seq check: 9920 from: 1 to: 
3090 Humanised nucleic acids sequence of FRSVHNPiV3 

GGGTGGTACACtAGtGTGATCACCATCGAGCTGAGCAACATCAAGGAGAA 
CAAGTGCAACGGCACCGACGCCAAGGTGAAGCTGATCAAGCAGGAGCTGG 
ACAAGTACAAGAGCGCCGTGACCGAGCTGCAGCTGCTGATGCAGAGCACC 
CCCGCCACCAACAACagaGCCAGGCGCGAGCTGCCCAGGTTCATGAACTA 
CACCCTCAACAACACCAAGAACACCAACGTGACCCTGAGCAAGAAGcGcA 
AGaggCGcTTCCTGGGCTTCCTGCTGGGCGTGGGCTCCGCCATCGCCAGC 
GGCATCGCGGTGTCCAAGGTCCTGCACCTGGAGGGGGAGGTGAACAAGAT 
CAAGAGCGCCCTGCTCTCCACCAACAAGGCGGTGGTCAGCCTGTCCAACG 
GCGTGAGCGTGCTGACCAGCAAGGTGCTGGACCTCAAGAACTACATCGAC 
AAGCAatTGCTCCCCATCGTGAACAAGCAGtcCTGCAGCATCTCTAACAT 
TGAGACCGTGATCGAGTTCCAGCAGAAGAACAACAGGCTGCTGGAGATCA 
CCAGGGAGTTCAGCGTGAACGCgGGcGTcACCACCCCGGTGAGCACCTAC 
ATGCTGACCAACAGCGAGCTGCTGTCCCTGATCAACGACATGCCCATCAC 
CAACGACCAGAAGAAGCTtATGTCCAACAACGTGCAGATCGTGCGCCAGC 
AGAGCTACagCATCATGagCATCATCAAGGAGGAGGTGCTGGCCTACGTG 
GTGCAGCTGCCCCTGTACGGCGTGATCGACACCCCCTGCTGGAAGCTGCA 
CACCTCCCCCCTGTGCACCACCAACACCT^AGGAGGGCTCCAACATCTGCC 
TGACCCGCACCGACCGGGGCTGGTACTGCGACAACGCCGGCTCCGTGTCC 
TTCTTCCCCCTGGCGGAGACCTGCAAGGTGCAGTCCAACCGCGTGTTCTG 
CGACACCATGAACAGCCTGACCCTGCCCAGCGAGGTGAACCTCTGCAACA 
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TCGACATCTTCAACCCCAAGTACGACTGCAAGATtATGacctccaagacc 
gacgtgagcagctccgtgatcacctccctgggcgccatcgtgtcctgcta 
cggcaagaccaagtgtacagcctccaacaagaaccgcggcatcatcaaga 
ccttctccaacggctgcgactacgtgtccaacaagggcgtggacaccgtg 
tccgtgggcaacaccctgtactacgtgaacaagcaggagggcaagagcct 
gtacgtgaagggcgagcccatcatcaacttctacgacccgctggtgttcc 
cctccgacgagttcgacgcctccatctcccaggtgaacgagaagatcaac 
cagagcctggccttcatccgcaagtccgacgagctgctgcacaacgtgaa 
cgccggcaagtccaccaccaacatcatgaacaacgagttcatggaggtga 
ccgagaagatccagatggcctccgacaacatcaacgacctgatccagtcc 
ggcgtgaacacccggctgctgaccatccagagccacgtgcagaactacat 
ccccatctccctgacccagcagatgtccgacctgcggaagttcatcagcg 
agat caeca tccggaacgacaaccaggaggtgcccccccagaggatcacc 
cacgacgtgggcataaagcccctgaaccccgacgacttctggcgctgcac 
ctccggcctcccctccctgatgaagacccccaagataaggctgatgcccg 
ggcccggcctgctggccatgcccaccaccgtggacggctgcgtgcgcacc 
ccctccctggtgatcaacgacctgatctacgcctacacctccaacctgat 
cacccgcggctgccaggacatcggcaagtcctaccaggtgctgcagatcg 
gcatcatcaccgtgaactccgacctggtacccgacctgaacccccggatc 
tcccacaccttcaacatcaacgacaacaggaagtcctgctccctggccct 
cctgaacaccgacgtgtaccagctgtgctccacgcccaaggtggacgagc 
gctccgactacgccagctccggcatcgaggacatcgtgctggacatcgtc 
aaccacgacggctccatctccaccacccgcttcaagaacaacaacatcag 
cttcgaccagccctacgccgccctgtacccctccgtgggccccggcatct 
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actacaagggcaagatcatcttcctgggctacggcggcctggagcacccc 
atcaacgagaacgccatctgcaacaccaccgggtgccccggcaagaccca 
gcgggactgcaaccaggcctcccacagcccctggttctccgaccgccgca 
tggtgaactccatcatcgtggtggacaagggcctgaactccatccccaag 
ctgaaggtgtggaccatctccatgcggcagaactactggggctccgaggg 
ccgcctgctgctgctgggcaacaagatctacatctacacccgctccacca 
gctggcacagcaagctgcagctgggcatcatcgacatcaccgactacagc 
gacatccgcatcaagtggacctggcacaacgtgctgagccggcccggcaa 
caacgagtgcccctggggccactcctgccccgacggctgcatcaccggcg 
tgtacaccgacgcctaccccctgaaccccaccggcagcatcgtgagctcc 
gtgatcctggactcccagaagtcccgggtgaaccccgtgatcacctacag 
cacctccaccgagcgcgtgaacgagctggccatccgcaacaagaccctga 
gcgccggctacaccaccaccagctgcatcacccactacaacaagggctac 
tgcttccacatcgtggagatcaaccacaagagcctggacaccttccagcc 
catgctgttcaagaccgagatccccaagagctgcagctaa 

[SEQ ID NO : 56] 
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Fig. 34A 
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Fig. 34B: Humanization impact on the level of expression of F^HNj 
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Fig 35: Codon usage of F MuV H Mv and highly expressed human genes (hum high exp) 
The frequencies (xlOO) of the individual codons are shown for each of the 
degenerately encoded amino acids, and the most prevalent codon is shown in bold. 
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Fig 36: Schematic diagram of the PCR synthesis of each fragment in which X and 
Y are restriction sites that allow retrieval of the full size fragment from the cloning 
vector. 
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Fig 37 : Sequence of the 12 oligonucleotides from which PCR fragment A was 
generated. 



l)oli 1 FmuvHmv 1-98, horn ARN 

ggtctagaccaccATGAAGGCGTTCCCCGTGATCTGCCTGGGCTTCGCCATCTTCTCCAG 

+ + + + + + 60 

CAGCATCTGCGTGAACATCAACATCCTGCAGCAGATCG 
+ + + 98 



2) oli 2 FmuvHmv 82-181, inv corap ARN 

GTTGGGCAGCAGCTTGACCACCACGTAGGAGCTGGAGCTCTGGGAGTAGTAGCTCAGCTG 

+ + + + + + 60 

CCTCACCTGCTGCTTGATGTATCCGATCTGCTGCAGGATG 
+ + + + 100 



3) oli 3 FmuvHmv, 166-264 horn ARN 

CAAGCTGCTGCCCAACATCCAGCCCACCGACAACAGCTGCGAGTTCAAGAGCGTGACCCA 

+ + + + + + 60 

GTACAACAAGACCCTGAGCAACCTGCTGCTGCCCATCGC 
+ + + 99 



4) oli 4 FmuvHmv, 250-352, inv compARN 

CAGGGCGGCGATGCCGATGGCGATGCCGGCGAACCGCTTGTGCCGCCGGGAGCCGGGGGA 

+ + + + + + 60 

GGGGGAGGTGATGTTGTTGATGTTCTCGGCGATGGGCAGCAGC 
+ + + +— 103 



5) oli 5 FmuvHmv, 338-441, horn ARN 

GGCATCGCCGCCCTGGGCGTGGCCACCGCCGCCCAGGTGACCGCCGCCGTGTCCCTGGTG 

+ + + + + + 60 

CAGGCCCAGACCAACGCCCGCGCCATCGCCGCCATGAAGAACTC 
+ + + + 104 



6) oli 6 FMuvHmv, 427-523, inv comp ARN 

GTCCTGGATGGCCTGCACGGCGATGGCCAGCTGCTGGGTGCCCTCCTTCACCTCGAACAC 

+ + + + + + 60 

GGCGCGGTTGGTGGCCTGGATGGAGTTCTTCATGGCG 
+ + + 97 



7) oli 7 FMUVHM 509-610, horn ARN 

CAGGCCATCCAGGACCACATCAACACCATCATGAACACCCAGCTGAACAACATGTCCTGC 

+ — + + + + + 60 

CAGATCCTGGACAACCAGCTGGCCACCTCCCTGGGCCTGTAC 
+ + + +— 102 



WO 00/18929 



PCT/EP99/07004 



52/73 

8) oli 8 FMUHM, 595-691, inv comp ARN 

GGACCGCAGGGCCTGGATACTGATGGGggaCAGGGCGGGGTTGATCAGCTGGGGCTGGAA 

+ + + + + + 60 

CACGGTGGTCAGCTCGGTCAGGTACAGGCCCAGGGAG 

. + + 97 



9) oli 9, 677-778, horn ARN 

CAGGCCCTGCGGTCCCTGCTGGGCAGCATGACCCCCGCCGTGGTGCAGGCCACCCTGAGC 

+ + + + + + 60 

ACCTCCATCAGCGCCGCCGAGATCCTGAGCGCCGGCCTGATG 



10) oli 10, FmuvHmv, 763-862, inv comp ARN 

GTTGGACTGGGTCACGATGGTGGGCACGTTGATCTTCACGATCATCTGCATCTCGTCCAG 

+ + + + + + 60 

CAGCACGGACACGATCTGGCCCTCCATCAGGCCGGCGCTC 

. + + + 100 



11) oli 11, FmuvHmv, 848-949, homARN 

GTGACCCAGTCCAACGCCCTGGTGATCGACTTCTACAGCATCAGCAGCTTCATCAACAAC 
+ 

CAGGAGTCCATCATCCAGCTGCCCGACCGCATCCTGGAGATC 



+ + + + + 60 



-+— 102 



12) oli 12 FMUHM, 935-1039, inv compARN 

GCTCAGCCGCTCGGCCTCGTTGTACTGGCAGAAGATGTGGTGGCGGGTCAGCTTGCAGTT 

+ + + + + + 60 

CTTGGCGGGGTAGCGCCACTGCTCGTTGCCGATCTCCAGGATGCG 

_ . + + + 105 



[SEQ ID NOS: 57-68 respectively] 
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Fig. 38 : Sequence of the 9 oligonucleotides from which PCR fragment B was 
generated. 

12) oli 12 FMUHM, 935-1039, inv compARN 

GCTCAGCCGCTCGGCCTCGTTGTACTGGCAGAAGATGTGGTGGCGGGTCAGCTTGCAGTT 

+ + + + + + 60 

CTTGGCGGGGTAGCGCCACTGCTCGTTGCCGATCTCCAGGATGCG 
+ +— r- + + 105 



13) oli 13, 1025-1129, horn ARN 

GCCGAGCGGCTGAGCCTGGAGACCAAGCTGTGCCTGGCCGGCAACATCAGCGCCTGCGTG 

+ + + + + + 60 

TTCTCCAGCATCGCCGGCAGCTACATGCGCCGCTTCGTGGCCCTG 
+ + + + 105 



14) oli 14, 1115-1216, inv comp ARN 

GGCGTGGTGGTCGGGCTGGTAGATGGGGTAGGAGGGGCTCTTGCACAGGCAGGTCAGGCT 

+ + + + + + 60 

GCGGCAGTTGGCCACGATGGTGCCGTCCAGGGCCACGAAGCG 
+ + + +— 102 



15) oli 15, 1202-1299, hom ARN 

CCCGACCACCACGCCGTGACCACCATCGACCTGACCTCCTGCCAGACCCTGAGCCTGGAC 

+ + + + + + 60 

GGCCTGGACTTCAGCATCGTGTCCCTGAGCAACATCAC 
+ + + 98 



16) oli 16 1285-1387, inv comp ARN 

CTTGCTCAGCTCGGTGGAGATGTCGATGGGCTGGGTGTTGATGGTCTGGCTCAGGCTGAT 

+ + + + + + 60 

GGTCAGGTTCTCGGCGTAGGTGATGTTGCTCAGG 
+ + + 94 



17) oli 17, 1363-1462, hom ARN 

CACCGAGCTGAGCAAGGTGAACGCCTCCCTGCAGAACGCCGTGAAGTACATCAAGGAGAG 

+ + + + + + 60 

CAACCACCAGCTGCAGAGCGTGAGCGTGAGCAGCAAGCGC 
+ + + + 100 



18) oli 18, 1447-1550, inv comp ARN 

TCACCTGGTGCTCGATGGAGTTGGTCACGTCCAGGTTGGTGCTCAGGCTCTTGTGGATCT 

+ + + + + + 60 

CGGCGGTGTAGATGGCGGCGCGGTGCAGGCGCTTGCTGCTCACG 
+ + + + 104 



WO 00/18929 



PCT/EP99/07004 



54/73 

19) oli 19, 1534-1636, hom ARN 

CATCGAGCACCAGGTGAAGGACGTGCTGACCCCCCTGTTCAAGATCATCGGCGACGAGGT 

+ + + + + + 60 

GGGCCTGCGCACCCCCCAGCGCTTCACCGACCTGGTGAAGTTC 
+ + + +— 103 



20) oli 20 FmuvHmv, 1622-1718, inv comp ARN 

GCTCGGGGGGGTTGATGCACCAGGTCAGGTCGCGGAAGTCGTACTCGCGGTCGGGGTTCA 

+ + + + + + 60 

GGAACTTGATCTTGTCGGAGATGAACTTCACCAGGTC 
+ + + 97 



[SEQ IDNOS: 69-77 respectively] 
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Fig. 39 : Sequence of the 11 oligonucleotides from which PCR fragment C was 
generated. 

20) oli 20 FmuvHmv, 1622-1718, inv comp ARN 

GCTCGGGGGGGTTGATGCACCAGGTCAGGTCGCGGAAGTCGTACTCGCGGTCGGGGTTCA 

+ + + + + + 60 

GGAACTTGATCTTGTCGGAGATGAACTTCACCAGGTC 
+ + + 97 

21) oli 21, FmuvHmv, 1701-1799, horn ARN 

GCATCAACCCCCCCGAGCGGATCAAGCTGGACTACGACCAGTACTGCGCCGACGTGGCCG 

+ + + + + + 60 

CCGAGGAGCTGATGAACGCCCTGGTGAACAGCACCCTGC 
+ + + 99 

22) oli 22,1784-1888, inv comp 

CATGTTGCTGAACTGGCCCCGGATGGTGGTGGGGCCGCTGCAGTTGCCCTTGCTCACGGC 

+ + + + + + 60 

CAGGAACTGGTTGGTGGTGCGGGTCTCCAGCAGGGTGCTGTTCAC 
+ + + + 105 

23) oli 23, 1874-1971, horn ARN 

CAGTTCAGCAACATGAGCCTGTCCCTGCTGGACCTGTACCTGGGCCGGGGCTACAACGTG 

+ + + + + + 60 

AGCAGCATCGTGACCATGACCAGCCAGGGCATGTACGG 
_ + + + 98 

24) oli 24, 1957-2057, inv. comp ARN 

CCACCTCGAACACGCGGTACATGCTCAGCTGGCTCAGCTCGCTCCGCTTGCTGCTCAGGT 
+ + + + + + 60 

TGGGCTTCTCCACCAGGTAGGTGCCGCCGTACATGCCCTGG 
+ + + +- 101 

25) oli 25, FmuvHmv, 2043-2140, homARN 

GCGTGTTCGAGGTGGGCGTGATCCGGAACCCCGGCCTGGGCGCCCCCGTGTTCCACATGA 

+ + + + + + 60 

CCAACTACCTGGAGCAGCCCGTGAGCAACGACCTGAGC 
+ + + 98 

26) oli26, FmuvHmv, 2125-2227, inv comp ARN 

GCCGCTGCCCTGGTAGGGGATGGTGATGCTGTCCTCGCCGTGGCACAGGGCGGCCAGCTT 

+ + + + + + 60 

CAGCTCGCCCAGGGCCACCATGCAGTTGCTCAGGTCGTTGCTC 
+ + + +— 103 

27) oli 27, 2212-2309, FmuvHm, horn ARN 

CTACCAGGGCAGCGGCAAGGGCGTGAGCTTCCAGCTGGTGAAGCTGGGCGTGTGGAAGAG 
+ + + + + + 60 

CCCCACCGACATGCAGAGCTGGGTGCCCCTGAGCACCG 
+ + + 98 
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28) oli 28,FmuvHmv, 2294-2392, inv comp ARN 

GGTGGGCACGGCCCACTTGGCCTGGTTGTCGGCGATCACGCCGCGGTGGCTGCTCAGGTA 

+ + + + + + 60 

CAGGCGGTCGATCACGGGGTCGTCGGTGCTCAGGGGCAC 
+ + + 99 



29) oli 29, Fmuv Hmv, 2377-2477, hom ARN 

GTGGGCCGTGCCCACCACCCGCACCGACGACAAGCTGCGCATGGAGACCTGCTTCCAGCA 

+ + + + + + 60 

GGCCTGCAAGGGCAAGATCCAGGCCCTGTGCGAGAACCCCG 
+ + + +- 101 



30) oli 30, FmuvHmv 2462-2561, inv comp 

TGATCTTCAGCTCCACGGTCAGGCTCAGGTCCACGCTCAGCACGCCGTAGCTGGGGATGC 

+ + + + + + 60 

GGTTGTCCTTCAGGGGGGCCCAtTCGGGGTTCTCGCACAG 
+ + + + 100 



[SEQ ID NOS: 78-88 respecively] 
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Fig. 40 : Sequence of the 8 oligonucleotides from which PCR fragment D was 
generated. 

30) oli 30/ FmuvHmv 24 62-2561, inv comp 

TGATCTTCAGCTCCACGGTCAGGCTCAGGTCCACGCTCAGCACGCCGTAGCTGGGGATGC 
+ + + + + + go 

GGTTGTCCTTCAGGGGGGCCCAtTCGGGGTTCTCGCACAG 
+ + + + 100 



31) oli31, FmuvHmv, 2546-2649, horn ARN 

GTGGAGCTGAAGATCAAGATCGCGAGCGGCTTCGGCCCCCTGATCACCCACGGCAGCGGC 
+ + + + + + go 

ATGGACCTGTACAAGAGCAACCACAACAACGTGTACTGGCTGAC 
+ + + + 104 



32) oli 32, FmuvHmv, 2635-2738, inv comp ARN 

CGGTGAACAGGTAGGGGCTCACCTTGAAGCGGGGaATCCACTCCAGGGTGTTGATCACGC 
+ + + + + + 60 

CCAGGGCCAGGTTCTTCATGGGGGGGATGGTCAGCCAGTACACG 
+ + + + 104 



33) oli 33, FmuvHmv, 2723-2827, horn ARN 

CCCTACCTGTTCACCGTGCCCATCAAGGAGGCCGGCGAGGACTGCCACGCCCCGACCTAC 
+ + + + + + 60 

CTGCCCGCCGAGGTGGACGGCGACGTGAAGCTGAGCAGCAACCTG 
+ + + + io5 



34) oli 34, FMUVHmv , 2813-2911, inv comp ARN 

CACGTAGTACACCACGGCGTGCTCCACGCGGCTGGTGTCGTAGGTGGCCAGCACGTACTG 

+ + + + + + 60 

CAGGTCCTGGCCGGGCAGGATCACCAGGTTGCTGCTCAG 



35) oli 35 FMUHM, 2897-2995, homARN 

GTGGTGTACTACGTGTACAGCCCCGGCCGCAGCTTCTTCTACTTCTACCCCTTCCGCCTG 
+ + + + + + 60 

CCCATCAAGGGCGTGCCCATCGAGCTGCAGGTGGAGTGC 
+ + + 99 



36) oli 36, FmuvHmv, 2981-3078, inv comp ARN 

CCGCTGTGGGTGATGTGGCCGCCGCTCTCGCTGTCGGCCAGCACGCAGAAGTGGCGGCAC 
+ + + + + + 60 

CACAGCTTCTGGTCCCAGGTGAAGCACTCCACCTGCAG 
+ + + 98 
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37) oli 37, 3064-3147, homARN 

CATCACCCACAGCGGCATGGTGGGCATGGGCGTGAGCTGCACCGTGACCCGCGAGGACGG 
„ + + + + + + 60 

CACCAACCGCCGCTAGcgaattcc 
— + + 84 



[SEQ ID NOS: 89-96 respectively] 
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Fig. 41 : Construction of pEE14F MuV hum HN MV hum 
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Figure 42A : Humanised nucleic acids sequence of F MuV H MV (upper 
sequence) compared to the original F MuV H MV sequence (lower sequence) and 
the corresponding amino acids sequence. 



• • • • ■ 

14 ATGAAGGCGTTCCCCGTGATCTGCCTGGGCTTCGCCATCTTCTCCAGCAG 63 
I I I 1! I I I II II II II III I I I I I I I II I I I I I II I 
1 ATGAAGGCTTTTCCAGTTATTTGCTTGGGCTTTGCAATCTTTTCATCCTC 50 

64 CATCTGCGTGAACATCAACATCCTGCAGCAGATCGGATACATCAAGCAGC 113 

II I! I I I I I I I I I I III I I I I I I I II I I I I I I I I I I I I I I I 
51 TATATGTGTGAATATCAATATCTTGCAGCAAATTGGATACATCAAGCAAC 100 

114 AGGTGAGGCAGCTGAGCTACTACTCCCAGAGCTCCAGCTCCTACGTGGTG 163 

I I I I I I I I I II Mill I I I I I II II II I I I I I I I I I I I III 
101 AGGTCAGGCAACTAAGCTATTACTCACAAAGTTCAAGCTCCTACGTAGTG 150 

164 GTCAAGCTGCTGCCCAACATCCAGCCCACCGACAACAGCTGCGAGTTCAA 213 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
151 GTCAAGCTTTTACCGAATATCCAACCCACTGATAACAGCTGTGAATTTAA 200 

214 GAGCGTGACCCAGTACAACAAGACCCTGAGCAACCTGCTGCTGCCCATCG 263 

Ml II II II I I II I M I II I II I I II II II II II II I 
201 GAGTGTAACTCAATACAATAAGACCTTGAGTAATTTGCTTCTTCCAATTG 250 

2 64 CCGAGAACATCAACAACATCACCTCCCCCTCCCCCGGCTCCCGGCGGCAC 313 

I I I II II I II I I I I I I I II I I I I I M I I I I I I I II 
251 CAGAAAACATAAACAATATTACGTCGCCCTCACCTGGGTCAAGACGTCAT 300 

314 AAGCGGTTCGCCGGCATCGCCATCGGCATCGCCGCCCTGGGCGTGGCCAC 363 

M I I I I I II I I 1 II Mill Mill II II II I II II II II 
301 AAACGGTTTGCTGGCATTGCCATTGGCATTGCgGCcCTCGGTGTTGCGAC 350 

364 CGCCGCCCAGGTGACCGCCGCCGTGTCCCTGGTGCAGGCCCAGACCAACG 413 

III II II II II I I II I I II II I II II li || I II II I 
351 CGCAGCACAAGTGACTGCCGCTGTCTCATTAGTTCAAGCACAGACAAATG 400 

414 CCCGCGCCATCGCCGCCATGAAGAACTCCATCCAGGCCACCAACCGCGCC 4 63 

I II II II II II Mill II II II I II II II II II II 

401 CACGTGCAATAGCAGCGATGAAAAATTCAATACAGGCAACTAATCGGGCA 4 50 

4 64 GTGTTCGAGGTGAAGGAGGGCACCCAGCAGCTGGCCATCGCCGTGCAGGC 513 

M I I I I I I I I II I I I I I I I I I I I II I I II II II II II II 
4 51 GTCTTCGAAGTGAAGGAAGGCACCCAACAGTTAGCTATAGCGGTACAAGC 500 

514 CATCCAGGACCACATCAACACCATCATGAACACCCAGCTGAACAACATGT 563 

II I I I I I II I I Mill II II I II I II II I II I M I II I I I I I 

501 cATcCAAGACCATATCAATACTATTATGAACACCCAATTGAACAATATGT 550 
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564 CCTGCCAGATCCTGGACAACCAGCTGGCCACCTCCCTGGGCCTGTACCTG 613 

I II I I I I I I I I II I I I I I I I I II I I I I I I I I II I I I I I I 
551 CTTGTCAGATCCTTGATAACCAGCTTGCAACCTCCCTAGGATTATACCTA 600 

614 ACCGAGCTGACCACCGTGTTCCAGCCCCAGCTGATCAACCCCGCCCTGtc 663 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I 1 I I I I 
601 ACAGAATTAACAACAGTGTTTCAGCCACAATTAATTAATCCAGCATTGTC 650 

664 cCCCATCAGTATCCAGGCCCTGCGGTCCCTGCTGGGCAGCATGACCCCCG 713 

II I I I I I I I II III M I I I I I I I I II II I I I I I II I 
651 ACCGATTAGTATACAAGCCTTGAGGTCTTTGCTTGGAAGTATGACACCTG 700 

714 CCGTGGTGCAGGCCACCCTGAGCACCTCCATCAGCGCCGCCGAGATCCTG 763 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I 

701 CAGTGGTTCAAGCAACATTATCTACTTCAATTTCTGCTGCTGAAATACTA 750 

764 AGCGCCGGCCTGATGGAGGGCCAGATCGTGTCCGTGCTGCTGGACGAGAT 813 

II Mill II I I I I I I I I I I I I I II II II I I I I I II I I I I I 

751 AGTGCCGGTCTAATGGAGGGTCAGATAGTTTCTGTTCTGCTAGATGAGAT 800 

814 GCAGATGATCGTGAAGATCAACGTGCCCACCATCGTGACCCAGTCCAACG 863 

I I I I I I I I I II I I I I I I I I II II I I I I I II II II II II I 
801 GCAGATGATAGTTAAGATAAACGTTCCAACCATTGTCACACAATCAAATG 850 

864 CCCTGGTGATCGACTTCTACAGCATCAGCAGCTTCATCAACAACCAGGAG 913 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 

851 CATTGGTGATTGACTTCTACTCAATTTCGAGTTTTATTAATAATCAAGAA 900 

914 TCCATCATCCAGCTGCCCGACCGCATCCTGGAGATCGGCAACGAGCAGTG 963 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I 
901 TCCATAATTCAATTGCCAGACAGGATCTTGGAGATCGGAAATGAACAATG 950 

964 GCGCTACCCCGCCAAGAACTGCAAGCTGACCCGCCACCACATCTTCTGCC 1013 

I I I I I I II II I I I I I II II I I I I I I I I I I I I I I I I I I I II 
951 GCGCTATCCAGCTAAGAATTGTAAGTTGACAAGACACCACATATTCTGCC 1000 

1014 AGTACAACGAGGCCGAGCGGCTGAGCCTGGAGACCAAGCTGTGCCTGGCC 1063 

I I I I I I I I I I I III I I I I I I I I II II II II II I I I I I II 
1001 AATACAATGAGGCAGAGAGGCTGAGCCTAGAAACAAAACTATGCCTTGCA 1050 

1064 GGCAACATCAGCGCCTGCGTGTTCTCCAGCATCGCCGGCAGCTACATGCG 1113 

I I I I I If II I I I I I I I I I I I I I II II II II It I II I 

1051 GGCAATATTAGTGCCTGTGTGTTCTCATCTATAGCAGGGAGTTATATGAG 1100 

1114 CCGCTTCGTGGCCCTGGACGGCACCATCGTGGCCAACTGCCGCAGCCTGA 1163 

II II II II I I I I I II II II II II I I II I II II II I 
1101 GCGATTTGTAGCACTGGATGGAACAATTGTTGCAAACTGTCGAAGTCTAA 1150 

1164 CCTGCCTGTGCAAGAGCCCCTCCTACCCCATCTACCAGCCCGACCACCAC 1213 

I II II MINIM II II II || M | | | I I II I I I I I II 
1151 CGTGTCTATGCAAGAGTCCATCTTATCCTATATACCAACCTGACCATCAT 1200 
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1214 GCCGTGACCACCATCGACCTGACCTCCTGCCAGACCCTGAGCCTGGACGG 1263 

I.I I I I I ! I I I I 11 I I I I I I I I I I I I II I I I I I I I I I 
1201 GCAGTCACGACCATTGATCTAACGTCATGTCAAACATTGTCCCTGGACGG 1250 

1264 CCTGGACTTCAGCATCGTGTCCCTGAGCAACATCACCTACGCCGAGAACC 1313 

I I I I I I I I I I I I I II II II I I I I I I I I I I I Mill Mill I 
1251 ACTGGATTTCAGCATTGTCTCGCTAAGCAACATCACTTACGCTGAGAATC 1300 

1314 TGACCATCAGCCTGAGCCAGACCATCAACACCCAGCCCATCGACATCTCC 1363 

I II II II I II I I .1 I I II II II I I II I II II II 

1301 TTACTATTTCATTGTCTCAGACAATCAATACTCAACCCATTGATATATCA 1350 

1364 ACCGAGCTGAGCAAGGTGAACGCCTCCCTGCAGAACGCCGTGAAGTACAT 1413 

I I M I I I I I I I I I I I II II II I I I II II I I I I I II I I I I I 
1351 ACTGAGCTGAGTAAGGTTAATGCATCCCTCCAAAATGCCGTTAAATACAT 14 00 

1414 CAAGGAGAGCAACCACCAGCTGCAGAGCGTGAGCGTGAGCAGCAAGCGCC 14 63 

I I I M II I I II I I I II I I I I I I I I I I I II II 

14 01 AAAAGAGAGTAACCATCAACTCCAATCCGTTAGTGTAAGTTCTAAAAGAC 1450 

14 64 TGCACCGCGCCGCCATCTACACCGCCGAGATCCACAAGAGCCTGAGCACC 1513 

I II M II I I II ! II I I M I I I II I I I I I I II I I I I I I I I I I I 

14 51 TTCATCGGGCAGCCATCTACACCGCAGAGATCCATAAAAGCCTCAGCACC 1500 

1514 AACCTGGACGTGACCAACTCCATCGAGCACCAGGTGAAGGACGTGCTGAC 1563 

II M II II M I I II I I I I M I I I I I I I I I I I I II I I M I I I I 
1501 AATCTAGATGTAACTAACTCAATCGAGCATCAGGTCAAGGACGTGCTGAC 1550 

1564 CCCCCTGTTCAAGATCATCGGCGACGAGGTGGGCCTGCGCACCCCCCAGC 1613 

II II II I II II II II II II M I II I I I I I I I 11 II III 

1551 ACCACTCTTCAAAATCATCGGTGATGAAGTGGGCCTGAGGACACCTCAGA 1600 

1614 GCTTCACCGACCTGGTGAAGTTCATCTCCGACAAGATCAAGTTCCTGAAC 1663 

I II I II I I I I I I II II I II I I I I I I I I I I I I I II II I I I II 

1601 GATTCACTGACCTAGTGAAATTCATCTCTGACAAGATTAAATTCCTTAAT 1650 

1664 CCCGACCGCGAGTACGACTTCCGCGACCTGACCTGGTGCATCAACCCCCC 1713 

II II I II I I I I II I I I I I II II II I I I I I I I I I I I I I II 
1651 CCGGATAGGGAGTACGACTTCAGAGATCTCACTTGGTGTATCAACCCGCC 1700 

1714 CGAGCGGATCAAGCTGGACTACGACCAGTACTGCGCCGACGTGGCCGCCG 1763 

I I I I II I II I I I 1 I I I I I I I II I I II I I II I II I I I 
1701 AGAGAGAATCAAATTGGATTATGATCAATACTGTGCAGATGTGGCTGCTG 1750 

1764 AGGAGCTGATGAACGCCCTGGTGAACAGCACCCTGCTGGAGACCCGCACC 1813 

I I I I I I I I I I I I I I I I I II I I II I I I I I M I I I I I I I 
1751 AAGAGCTCATGAATGCATTGGTGAACTCAACTCTACTGGAGACCAGAACA 1800 

1814 ACCAACCAGTTCCTGGCCGTGAGCAAGGGCAACTGCAGCGGCCCCACCAC 1863 

I I I I I II I I I I I I II II I I II I I II I I I M I I I I I II 

1801 ACCAATCAGTTCCTAGCTGTCTCAAAGGGAAACTGCTCAGGGCCCACTAC 1850 
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1864 CATCCGGGGCCAGTTCAGCAACATGAGCCTGTCCCTGCTGGACCTGTACC 1913 

III I II II III I I I I I I I I I I I I I I I I III I I I I 
1851 AATCAGAGGTCAATTCTCAAACATGTCGCTGTCCCTGTTAGACTTGTATT 1900 

1914 TGGGCCGGGGCTACAACGTGAGCAGCATCGTGACCATGACCAGCCAGGGC 1963 

I II II II I I II I Ml II II I I I I I I I I I I I I I 

1901 TAGGTCGAGGTTACAATGTGTCATCTATAGTCACTATGACATCCCAGGGA 1950 

1964 ATGTACGGCGGCACCTACCTGGTGGAGAAGCCCAACCTGAGCAGCAAGCG 2013 

I I I I I I I II II Mill I I I I I I I I I I II I I I I I I I I I I I I 
1951 ATGTATGGGGGAACTTACCTAGTGGAAAAGCCTAATCTGAGCAGCAAAAG 2000 

2014 GAGCGAGCTGAGCCAGCTGAGCATGTACCGCGTGTTCGAGGTGGGCGTGA 2063 

I ill II II I I I I I I I I I II I I I I I I I I II II II II I 
2001 GTCAGAGTTGTCACAACTGAGCATGTACCGAGTGTTTGAAGTAGGTGTTA 2050 

2064 TCCGGAACCCCGGCCTGGGCGCCCCCGTGTTCCACATGACCAACTACCTG 2113 

I I I I I I I I I I 1 I I I I I I I I I I I I I I I I t I I I I I I I I I 
2051 TCAGAAATCCGGGTTTGGGGGCTCCGGTGTTCCATATGACAAACTATCTT 2100 

2114 GAGCAGCCCGTGAGCAACGACCTGAGCAACTGCATGGTGGCCCTGGGCGA 2163 

I I I I I II II II II II II I I I I I I I I IIMIIM I I I I II 
2101 GAGCAACCAGTCAGTAATGATCTCAGCAACTGTATGGTGGCTTTGGGGGA 2150 

2164 GCTGAAGCTGGCCGCCCTGTGCCACGGCGAGGACAGCATCACCATCCCCT 2213 

III II II II I I I I I II II I I I II II I I I I I II I I I I 

2151 GCTCAAACTCGCAGCCCTTTGTCACGGGGAAGATTCTATCACAATTCCCT 2200 

2214 ACCAGGGCAGCGGCAAGGGCGTGAGCTTCCAGCTGGTGAAGCTGGGCGTG 2263 

I I II M II II II II I I I I I I I I I I I II Mill II II 
2201 ATCAGGGATCAGGGAAAGGTGTCAGCTTCCAGCTCGTCAAGCTAGGTGTC 2250 

2264 TGGAAGAGCCCCACCGACATGCAGAGCTGGGTGCCCCTGAGCACCGACGA 2313 

Mill II I I I I I I I I I I I I II I I I I I I I I II I I I I 

2251 TGGAAATCCCCAACCGACATGCAATCCTGGGTCCCCTTATCAACGGATGA 2300 

2314 CCCCGTGATCGACCGCCTGTACCTGAGCAGCCACCGCGGCGTGATCGCCG 2363 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 

2301 TCCAGTGATAGACAGGCTTTACCTCTCATCTCACAGAGGTGTTATCGCTG 2350 

2364 ACAACCAGGCCAAGTGGGCCGTGCCCACCACCCGCACCGACGACAAGCTG 2413 

I I I I I II II II I I I I I II II II II II II II I I I II I II 
2351 ACAAcCAAGCAAAATGGGCTGTCCCGACAACACGAACAGATGACAAGTTG 2400 

2 414 CGCATGGAGACCTGCTTCCAGCAGGCCTGCAAGGGCAAGATCCAGGCCCT 24 63 

II I I I I I I II I I I I I I I I I II II II I I I I I II M I M II II 
2401 CGAATGGAGACATGCTTCCAACAGGCGTGTAAGGGTAAAATCCAAGCACT 2450 



2464 GTGCGAGAACCCCGAaTGGGCCCCCCTGAAGGACAACCGCATCCCCAGCT 2513 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I 
2451 CTGCGAGAATCCCGAGTGGGCACCATTGAAGGATAACAGGATTCCTTCAT 2500 
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2514 ACGGCGTGCTGAGCGTGGACCTGAGCCTGACCGTGGAGCTGAAGATCAAG 2563 

I I I I I I II II II I I I I I I I I I I I I I I I I I II I I I I I 
2501 ACGGGGTCTTGTCTGTTGATCTGAGTCTGACAGTTGAGCTTAAAATCAAA 2550 

2564 ATCGCGAGCGGCTTCGGCCCCCTGATCACCCACGGCAGCGGCATGGACCT 2613 

Mil I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
2551 ATTGCTTCGGGATTCGGGCCATTGATCACACACGGTTCAGGGATGGACCT 2600 

• • • • • 

2 614 GTACAAGAGCAACCACAACAACGTGTACTGGCTGACCATCCCCCCCATGA 2663 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II Mil 
2601 ATACAAATCCAACCACAACAATGTGTATTGGCTGACTATCCCGCCAATGA 2650 

2664 AGAACCTGGCCCTGGGCGTGATCAACACCCTGGAGTGGATtCCCCGCTTC 2713 

I I I I I I I III I II II I I I I I I I I I I I I I I I I I I II I Ml 
2 651 AGAACCTAGCCTTAGGTGTAATCAACACATTGGAGTGGATACCGAGATTC 2700 

2714 AAGGTGAGCCCCTACCTGTTCACCGTGCCCATCAAGGAGGCCGGCGAGGA 2763 

I I I I I II I I I I II I I I I I I I II II II I I I I I II I I I I I II 
2701 AAGGTTAGTCCCTACCTCTTCAcTGTCCCAATTAAGGAAGCAGGCGAAGA 2750 

2764 CTGCCACGCCCCGACCTACCTGCCCGCCGAGGTGGACGGCGACGTGAAGC 2813 

I I I I I I I I I I I II I I I I I II II I I I I I I I I II II II II I 

27 51 CTGCCATGCCCCAACATACCTACCTGCGGAGGTGGATGGTGATGTCAAAC 2800 

2814 TGAGCAGCAACCTGGTGATCCTGCCCGGCCAGGACCTGCAGTACGTGCTG 2863 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
2801 TCAGTTCCAATCTGGTGATTCTACCTGGTCAAGATCTCCAATATGTTTTG 2850 

2864 GCCACCTACGACACCAGCCGCGTGGAGCACGCCGTGGTGTACTACGTGTA 2913 

II I I I I I I I I II I I Mil M II I I II I II II II I II 
2851 GCAACCTACGATACTTCCAGGGTTGAACATGCTGTGGTTTATTACGTTTA 2900 

2914 CAGCCCCGGCCGCAGCTTCTTCTACTTCTACCCCTTCCGCCTGCCCATCA 2963 

II I II I I I II II I I II II II I M II II I II II I I I 
2901 CAGCCCAgGCCGCTCATTTTtTTACTTTTATCCTTTTAGGTTGCCTATAA 2950 

2964 AGGGCGTGCCCATCGAGCTGCAGGTGGAGTGCTTCACCTGGGACCAGAAG 3013 

II II M I I II II II I II II I I I II II I II I M II II I I II 
2951 AGGGGGTCCCCATCGAATTACAAGTGGAATGCTTCACATGGGACCAAAAA 3000 

3014 CTGTGGTGCCGCCACTTCTGCGTGCTGGCCGACAGCGAGAGCGGCGGCCA 3063 

II II I I II II II II II I I II II I II III II II II II 

3001 CTCTGGTGCCGTCACTTCTGTGTGCTTGCGGACTCAGAATCTGGTGGACA 3050 

3064 CATCACCCACAGCGGCATGGTGGGCATGGGCGTGAGCTGCACCGTGACCC 3113 

I II II III II I II II I I II I I II I II I I II II II II II II 
3051 TATCACTCACTCTGGGATGGtGGGCATGGGAGTCAGCTGCACAGTCACCC 3100 

3114 GCGAGGACGGCACCAACCGCCGCTAG 3139 

I II I I II II I I I III I III 
3101 GGGAAGATGGAACCAATCGCAGATAG 3126 
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Fig. 428:?^!!^^ check: 4381 from: 1 to: 3126 
nucleic acid sequence of F^Hj^vCnon humanised) 

ATGAAGGCTTTTCCAGTTATTTGCTTGGGCTTTGCAATCTTTTCATCCTC 
TATATGTGTGAATATCAATATCTTGCAGCAAATTGGATACATCAAGCAAC 
AGGTCAGGCAACTAAGCTATTACTCACAAAGTTCAAGCTCCTACGTAGTG 
GTCAAGCTTTTACCGAATATCCAACCCACTGATAACAGCTGTGAATTTAA 
GAGTGTAACTCAATACAATAAGACCTTGAGTAATTTGCTTCTTCCAATTG 
CAGAAAACATAAACAATATTACGTCGCCCTCACCTGGGTCAAGACGTCAT 
AAACGGTTTGCTGGCATTGCCATTGGCATTGCgGCcCTCGGTGTTGCGAC 
CGCAGCACAAGTGACTGCCGCTGTCTCATTAGTTCAAGCACAGACAAATG 
CACGTGCAATAGCAGCGATGAAAAATTCAATACAGGCAACTAATCGGGCA 
GTCTTCGAAGTGAAGGAAGGCACCCAACAGTTAGCTATAGCGGTACAAGC 
cATcCAAGACCATATCAATACTATTATGAACACCCAATTGAACAATATGT 
CTTGTCAGATCCTTGATAACCAGCTTGCAACCTCCCTAGGATTATACCTA 
ACAGAATTAACAACAGTGTTTCAGCCACAATTAATTAATCCAGCATTGTC 
ACCGATTAGTATACAAGCCTTGAGGTCTTTGCTTGGAAGTATGACACCTG 
CAGTGGTTCAAGCAACATTATCTACTTCAATTTCTGCTGCTGAAATACTA 
AGTGCCGGTCTAATGGAGGGTCAGATAGTTTCTGTTCTGCTAGATGAGAT 
GCAGATGATAGTTAAGATAAACGTTCCAACCATTGTCACACAATCAAATG 
CATTGGTGATTGACTTCTACTCAATTTCGAGTTTTATTAATAATCAAGAA 
TCCATAATTCAATTGCCAGACAGGATCTTGGAGATCGGAAATGAACAATG 
GCGCTATCCAGCTAAGAATTGTAAGTTGACAAGACACCACATATTCTGCC 
AATACAATGAGGCAGAGAGGCTGAGCCTAGAAACAAAACTATGCCTTGCA 
GGCAATATTAGTGCCTGTGTGTTCTCATCTATAGCAGGGAGTTATATGAG 
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GCGATTTGTAGCACTGGATGGAACAATTGTTGCAAACTGTCGAAGTCTAA 
CGTGTCTATGCAAGAGTCCATCTTATCCTATATACCAACCTGACCATCAT 
GCAGTCACGACCATTGATCTAACGTCATGTCAAACATTGTCCCTGGACGG 
ACTGGATTTCAGCATTGTCTCGCTAAGCAACATCACTTACGCTGAGAATC 
TTACTATTTCATTGTCTCAGACAATCAATACTCAACCCATTGATATATCA 
ACTGAGCTGAGTAAGGTTAATGCATCCCTCCAAAATGCCGTTAAATACAT 
AAAAGAGAGTAACCATCAACTCCAATCCGTTAGTGTAAGTTCTAAAAGAC 
TTCATCGGGCAGCCATCTACACCGCAGAGATCCATAAAAGCCTCAGCACC 
AATCTAGATGTAACTAACTCAATCGAGCATCAGGTCAAGGACGTGCTGAC 
ACCACTCTTCAAAATCATCGGTGATGAAGTGGGCCTGAGGACACCTCAGA 
GATTCACTGACCTAGTGAAATTCATCTCTGACAAGATTAAATTCCTTAAT 
CCGGATAGGGAGTACGACTTCAGAGATCTCACTTGGTGTATCAACCCGCC 
AGAGAGAATCAAATTGGATTATGATCAATACTGTGCAGATGTGGCTGCTG 
AAGAGCTCATGAATGCATTGGTGAACTCAACTCTACTGGAGACCAGAACA 
ACCAATCAGTTCCTAGCTGTCTCAAAGGGAAACTGCTCAGGGCCCACTAC 
AATCAGAGGTCAATTCTCAAACATGTCGCTGTCCCTGTTAGACTTGTATT 
TAGGTCGAGGTTACAATGTGTCATCTATAGTCACTATGACATCCCAGGGA 
ATGTATGGGGGAACTTACCTAGTGGAAAAGCCTAATCTGAGCAGCAAAAG 
GTCAGAGTTGTCACAACTGAGCATGTACCGAGTGTTTGAAGTAGGTGTTA 
TCAGAAATCCGGGTTTGGGGGCTCCGGTGTTCCATATGACAAACTATCTT 
GAGCAACCAGTCAGTAATGATCTCAGCAACTGTATGGTGGCTTTGGGGGA 
GCTCAAACTCGCAGCCCTTTGTCACGGGGAAGATTCTATCACAATTCCCT 
ATCAGGGATCAGGGAAAGGTGTCAGCTTCCAGCTCGTCAAGCTAGGTGTC 
TGGAAATCCCCT^ACCGACATGCAATCCTGGGTCCCCTTATCAACGGATGA 
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TCCAGTGATAGACAGGCTTTACCTCTCATCTCACAGAGGTGTTATCGCTG 
ACAAcCAAGCAAAATGGGCTGTCCCGACAACACGAACAGATGACAAGTTG 
CGAATGGAGACATGCTTCCAACAGGCGTGTAAGGGTAAAATCCAAGCACT 
CTGCGAGAATCCCGAGTGGGCACCATTGAAGGATAACAGGATTCCTTCAT 
ACGGGGTCTTGTCTGTTGATCTGAGTCTGACAGTTGAGCTTAAAATCAAA 
ATTGCTTCGGGATTCGGGCCATTGATCACACACGGTTCAGGGATGGACCT 
ATACAAATCCAACCACAACAATGTGTATTGGCTGACTATCCCGCCAATGA 
AGAACCTAGCCTTAGGTGTAATCAACACATTGGAGTGGATACCGAGATTC 
AAGGTTAGTCCCTACCTCTTCAcTGTCCCAATTAAGGAAGCAGGCGAAGA 
CTGCCATGCCCCAACATACCTACCTGCGGAGGTGGATGGTGATGTCAAAC 
TCAGTTCCAATCTGGTGATTCTACCTGGTCAAGATCTCCAATATGTTTTG 
GCAACCTACGATACTTCCAGGGTTGAACATGCTGTGGTTTATTACGTTTA 
CAGCCCAgGCCGCTCATTTTtTTACTTTTATCCTTTTAGGTTGCCTATAA 
AG GGGGT C CC C AT C G AAT T AC AAG T GG AAT G CT T C AC AT G G GACCAAAAA 
CTCTGGTGCCGTCACTTCTGTGTGCTTGCGGACTCAGAATCTGGTGGACA 
TATCACTCACTCTGGGATGGtGGGCATGGGAGTCAGCTGCACAGTCACCC 
GGGAAGATGGAACCAATCGCAGATAG 
[SEQ ID NO: 97] 
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Fig 42C: F MUV humH M hum.seq check: 5778 from: 14 to: 3139 
Humanised nucleic acids sequence of F MuV H Mv 

ATGAAGGCGTTCCCCGTGATCTGCCTGGGCTTCGCCATCTTCTCCAGCAG 
CATCTGCGTGAACATCAACATCCTGCAGCAGATCGGATACATCAAGCAGC 
AGGTGAGGCAGCTGAGCTACTACTCCCAGAGCTCCAGCTCCTACGTGGTG 
GTCAAGCTGCTGCCCAACATCCAGCCCACCGACAACAGCTGCGAGTTCAA 
GAGCGTGACCCAGTACAACAAGACCCTGAGCAACCTGCTGCTGCCCATCG 
CCGAGAACATCAACAACATCACCTCCCCCTCCCCCGGCTCCCGGCGGCAC 
AAGCGGTTCGCCGGCATCGCCATCGGCATCGCCGCCCTGGGCGTGGCCAC 
CGCCGCCCAGGTGACCGCCGCCGTGTCCCTGGTGCAGGCCCAGACCAACG 
CCCGCGCCATCGCCGCCATGAAGAACTCCATCCAGGCCACCAACCGCGCC 
GTGTTCGAGGTGAAGGAGGGCACCCAGCAGCTGGCCATCGCCGTGCAGGC 
CATCCAGGACCACATCAACACCATCATGAACACCCAGCTGAACAACATGT 
CCTGCCAGATCCTGGACAACCAGCTGGCCACCTCCCTGGGCCTGTACCTG 
ACCGAGCTGACCACCGTGTTCCAGCCCCAGCTGATCAACCCCGCCCTGtc 
cCCCATCAGTATCCAGGCCCTGCGGTCCCTGCTGGGCAGCATGACCCCCG 
CCGTGGTGCAGGCCACCCTGAGCACCTCCATCAGCGCCGCCGAGATCCTG 
AGCGCCGGCCTGATGGAGGGCCAGATCGTGTCCGTGCTGCTGGACGAGAT 
GCAGATGATCGTGAAGATCAACGTGCCCACCATCGTGACCCAGTCCAACG 
CCCTGGTGATCGACTTCTACAGCATCAGCAGCTTCATCAACAACCAGGAG 
TCCATCATCCAGCTGCCCGACCGCATCCTGGAGATCGGCAACGAGCAGTG 
GCGCTACCCCGCCAAGAACTGCAAGCTGACCCGCCACCACATCTTCTGCC 
AGTACAACGAGGCCGAGCGGCTGAGCCTGGAGACCAAGCTGTGCCTGGCC 
GGCAACATCAGCGCCTGCGTGTTCTCCAGCATCGCCGGCAGCTACATGCG 
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CCGCTTCGTGGCCCTGGACGGCACCATCGTGGCCAACTGCCGCAGCCTGA 
CCTGCCTGTGCAAGAGCCCCTCCTACCCCATCTACCAGCCCGACCACCAC 
GCCGTGACCACCATCGACCTGACCTCCTGCCAGACCCTGAGCCTGGACGG 
CCTGGACTTCAGCATCGTGTCCCTGAGCAACATCACCTACGCCGAGAACC 
TGACCATCAGCCTGAGCCAGACCATCAACACCCAGCCCATCGACATCTCC 
ACCGAGCTGAGCAAGGTGAACGCCTCCCTGCAGAACGCCGTGAAGTACAT 
CAAGGAGAGCAACCACCAGCTGCAGAGCGTGAGCGTGAGCAGCAAGCGCC 
TGCACCGCGCCGCCATCTACACCGCCGAGATCCACAAGAGCCTGAGCACC 
AACCTGGACGTGACCAACTCCATCGAGCACCAGGTGAAGGACGTGCTGAC 
CCCCCTGTTCAAGATCATCGGCGACGAGGTGGGCCTGCGCACCCCCCAGC 
GCTTCACCGACCTGGTGAAGTTCATCTCCGACAAGATCAAGTTCCTGAAC 
CCCGACCGCGAGTACGACTTCCGCGACCTGACCTGGTGCATCAACCCCCC 
CGAGCGGATCAAGCTGGACTACGACCAGTACTGCGCCGACGTGGCCGCCG 
AGGAGCTGATGAACGCCCTGGTGAACAGCACCCTGCTGGAGACCCGCACC 
ACCAACCAGTTCCTGGCCGTGAGCAAGGGCAACTGCAGCGGCCCCACCAC 
CATCCGGGGCCAGTTCAGCAACATGAGCCTGTCCCTGCTGGACCTGTACC 
TGGGCCGGGGCTACAACGTGAGCAGCATCGTGACCATGACCAGCCAGGGC 
ATGTACGGCGGCACCTACCTGGTGGAGAAGCCCAACCTGAGCAGCAAGCG 
GAGCGAGCTGAGCCAGCTGAGCATGTACCGCGTGTTCGAGGTGGGCGTGA 
TCCGGAACCCCGGCCTGGGCGCCCCCGTGTTCCACATGACCAACTACCTG 
GAGCAGCCCGTGAGCAACGACCTGAGCAACTGCATGGTGGCCCTGGGCGA 
GCTGAAGCTGGCCGCCCTGTGCCACGGCGAGGACAGCATCACCATCCCCT 
ACCAGGGCAGCGGCAAGGGCGTGAGCTTCCAGCTGGTGAAGCTGGGCGTG 
TGGAAGAGCCCCACCGACATGCAGAGCTGGGTGCCCCTGAGCACCGACGA 
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CCCCGTGATCGACCGCCTGTACCTGAGCAGCCACCGCGGCGTGATCGCCG 
ACAACCAGGCCAAGTGGGCCGTGCCCACCACCCGCACCGACGACAAGCTG 
CGCATGGAGACCTGCTTCCAGCAGGCCTGCAAGGGCAAGATCCAGGCCCT 
GTGCGAGAACCCCGAaTGGGCCCCCCTGAAGGACAACCGCATCCCCAGCT 
ACGGCGTGCTGAGCGTGGACCTGAGCCTGACCGTGGAGCTGAAGATCAAG 
ATCGCGAGCGGCTTCGGCCCCCTGATCACCCACGGCAGCGGCATGGACCT 
GTACAAGAGCAACCACAACAACGTGTACTGGCTGACCATCCCCCCCATGA 
AGAACCTGGCCCTGGGCGTGATCAACACCCTGGAGTGGATtCCCCGCTTC 
AAGGTGAGCCCCTACCTGTTCACCGTGCCCATCAAGGAGGCCGGCGAGGA 
CTGCCACGCCCCGACCTACCTGCCCGCCGAGGTGGACGGCGACGTGAAGC 
TGAGCAGCAACCTGGTGATCCTGCCCGGCCAGGACCTGCAGTACGTGCTG 
GCCACCTACGACACCAGCCGCGTGGAGCACGCCGTGGTGTACTACGTGTA 
CAGCCCCGGCCGCAGCTTCTTCTACTTCTACCCCTTCCGCCTGCCCATCA 
AGGGCGTGCCCATCGAGCTGCAGGTGGAGTGCTTCACCTGGGACCAGAAG 
CTGTGGTGCCGCCACTTCTGCGTGCTGGCCGACAGCGAGAGCGGCGGCCA 
CATCACCCACAGCGGCATGGTGGGCATGGGCGTGAGCTGCACCGTGACCC 
GCGAGGACGGCACCAACCGCCGCTAG 
[SEQ ID NO: 98] 
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the FHN protein 
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Fig. 44 IgGl, directed against 
the FHN protein. 
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